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Preface 



The Colloquium on Structural Information and Communication Complexity 
(SIROCCO) is an annual meeting focused on the relationship between comput- 
ing and communication. Over its 11 years of existence, SIROCCO has gained a 
considerable respect and has become an acknowledged forum bringing together 
specialists interested in the fundamental principles underlying all computing 
through communication. 

SIROCCO 2004 was the 11th in this series, held in Smolenice Castle, June 
21-23, 2004. Previous SIROCCO colloquia took place in Ottawa (1994), Olympia 
(1995), Siena (1996), Ascona (1997), Amalfi (1998), Lacanau-Ocean (1999), 
L’Aquila (2000), Val de Nuria (2001), Andros (2002), and Umea(2003). The 
colloquium in 2004 was special in the respect that, for the first time, the pro- 
ceedings were published in the Lecture Notes in Computer Science series of 
Springer- Verlag. 

SIROCCO has always encouraged high-quality research focused on the study 
of those factors which are significant for the computability and the communica- 
tion complexity of problems, and on the interplay between structure, knowledge, 
and complexity. It covers topics as distributed computing, mobile computing, 
optical computing, parallel computing, communication complexity, information 
dissemination, routing protocols, distributed data-structures, models of commu- 
nication, network topologies, high-speed interconnection networks, wireless net- 
works, sense of direction, structural properties, and topological awareness. The 
56 contributions submitted to this year’s SIROCCO were subject to a thorough 
refereeing process and 26 high quality submissions were selected for publication. 
We thank the Program Committee members for their profound and careful work. 
Our gratitude extends to the numerous subreferees for their valuable refereeing. 
We also acknowledge the effort of all authors who submitted their contributions. 

We thank the invited speakers at this colloquium, Paul Spirakis (Patras) 
and Shmuel Zaks (Haifa) for accepting our invitation to share their insights on 
new developments in their areas of interest. Paul Spirakis delivered a talk about 
‘'^Algorithmic aspects in congestion games'' and Shmuel Zaks presented “Results 
and research directions in ATM and optical network^'. 

We would like to express our sincere gratitude to the conference chair David 
Peleg (Rehovot) for his enthusiasm and invaluable consultations, and to the 
organizing team chaired by Dana Pardubska and Imrich Vrt’o. 

Our deepest respect belongs to our late friend and colleague Peter Ruzicka 
who started the preparation of SIROCCO 2004. 



June 2004 



Rastislav Kralovic 
Ondrej Sykora 
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Abstract. We consider the traffic grooming problem in passive WDM 
star networks. Traffic grooming is concerned with the development of 
techniques for combining low speed traffic components onto high speed 
channels in order to minimize network cost. We first prove that the traf- 
fic grooming problem in star networks is NP-hard for a more restricted 
case than the one considered in [2] . Then, we propose a polynomial time 
algorithm for the special case where there are two wavelengths per fiber 
using matching techniques. Purthermore, we propose two reductions of 
our problem to two combinatorial optimization problems, the constrained 
multiset multicover problem [3], and the demand matching problem [4] al- 
lowing us to obtain a polynomial time H 2 C (resp. 2 -1- |) approximation 
algorithm for the minimization (resp. maximization) version of the prob- 
lem, where C is the capacity of each wavelength. 

Keywords: star, traffic grooming, WDM network, approximation, algo- 
rithm. 



1 Introduction 

Recently, in order to utilize bandwidth more effectively, new models appeared 
allowing several independent traffic streams to share the bandwidth of a light- 
path. It is in general impossible to setup lightpaths between every pair of edge 
routers and thus it is natural to consider that traffic is electronically switched 
(groomed) onto new lightpaths toward the destination node. The introduction of 
electronic switching increases the degree of connectivity among the edge routers 
while at the same time it may reduce significantly wavelength requirements for 
a given traffic demand. The drawback of this approach is that the introduction 
of expensive active components (optical transceivers and electronic switches) 
may increase the cost of the network. These observations motivated R. Dutta 
and G.N. Rouskas [2] to study the traffic grooming problem that we consider in 
this paper in order to find a tradeoff between the cost of the network and its 
performance. 

We focus on star networks composed by a set of transmitters, a set of receivers 
and a hub, and the goal is to minimize the total amount of electronic switching. 
This cost function measures exactly the amount of electronic switching inside 
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the network (but it only indirectly captures the transceiver cost). Our interest 
to star networks besides their simplicity, which allows us to provide the first 
approximation algorithms with performance guarantee for this variant of the 
traffic grooming problem, is also motivated by their use in the interconnection 
of LANs or MANs with a wide area backbone. 



1.1 Problem Definition 

We consider a network in the form of a star with + 1 nodes. There is a single 
hub node which is connected to every other node by a physical link. All the 
nodes, except the hub, are divided into two groups Vi and V 2 : the nodes in Vi 
are the transmitters and the nodes in V 2 are the receivers. The hub is numbered 
0 and the N other nodes are numbered from 1 to N in some arbitrary order. 
Each physical link consists of a fiber, and each fiber can carry W wavelengths. 
Each wavelength has a capacity C, expressed in units of some arbitrary rate. 
We represent a traffic pattern by a demand matrix T = \tij\, where integer tij 
denotes the number of traffic streams (each unit demand) from node z G Vi to 
node j G V 2 • We do not allow the traffic demands to be greater than the capacity 
of a lightpath, i.e. for all (z, j),0 < < C. 

The hub has both optical and electronic switching capabilities: it let some 
lightpaths pass through transparently, while it may terminate some others. Traf- 
fic on terminated lightpaths is electronically switched (groomed) onto a new 
lightpath towards the destination node. A traffic demand (or request) Uj must 
have its own wavelength from z to the hub and from the hub to j to be optically 
routed, whereas it can share a wavelength with some other traffic demands if it 
is electronically switched. The goal we consider in this paper is to minimize the 
total amount of electronic switching at the hub. This problem is often called the 
traffic grooming problem. 

R. Dutta and G.N. Rouskas considered in [2] the traffic grooming problem in 
several network topologies, including a star network. However there are differ- 
ences between their problem and ours: in [2], each node of the network, including 
the hub, can be a transmitter and a receiver, and traffic demands between two 
nodes are allowed to be greater than the capacity of a wavelength (i.e. it is pos- 
sible that tij > C for some i,j). To distinguish the two problems, we will call 
their problem the traffic grooming problem in an active star, and our problem 
the traffic grooming problem in a passive star (see Section 4 for an integer linear 
programming formulation of the problem) . Once we know which traffic demands 
are optically routed, the wavelength assignment problem is easy in the case of a 
passive star network. 

There are in fact two versions of the traffic grooming problem: either we 
want to minimize the total amount of electronic switching at the hub (this is 
the minimization version), or we want to maximize the total amount of traffic 
which is optically routed (this is the maximization version). These two versions 
are equivalent (i.e. an optimal solution for one is also an optimal solution for 
the other one) because the optimal solution of the maximization problem is 
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equal to the sum of all the traffic demands, minus the optimal solution of the 
minimization problem. 

Our results are as follows. First, we show in Section 2 that the traffic groom- 
ing problem in a passive star is NP-Complete, in both the minimization and the 
maximization versions of the problem. Then we show in Section 3 that these 
problems are polynomially solvable if there are only two wavelengths per fiber 
{W = 2): we give an algorithm which gives an optimal solution. In Section 4, we 
show that we cannot deduce a constant approximation guarantee of the maxi- 
mization (resp. minimization) version from a constant approximation guarantee 
of the minimization (resp. maximization) version of the problem, and we give 
two approximation algorithms. The first one concerns the minimization ver- 
sion: we transform our problem in a constrained multiset multicover problem 
[3], and we get an approximation guarantee of H 2 C- The second approximation 
algorithm concerns the maximization version: we transform our problem in a 
demand matching problem in a bipartite graph [4], and we obtain an approxi- 
mation guarantee of {2 + |). We conclude the paper in Section 5. 

2 Hardness Results 

Let us show in this section that the decision version of the grooming problem in 
a passive star is NP-Complete. 

In order to do this proof, we were inspired by the proof of R. Dutta and G.N. 
Rouskas in [2]: in this paper they showed that their traffic grooming problem 
is NP-complete. They reduced the decision version of the Knapsack problem to 
their problem. We do the same reduction, replacing traffic demands Uj greater 
than C by several traffic demands of the same weight from i, or to j. They 
also used traffic demands to the hub (or from the hub) . We replace these traffic 
demands by traffic demands to some new nodes (or from some new nodes) and 
we force these traffic demands to be switched electronically at the hub. 

We reduce the decision version of the Knapsack problem [1] to our grooming 
problem: let Q € is there a solution of our grooming problem in which the 
amount of optically routed traffic is greater or equal to Q? An instance of the 
Knapsack problem is given by a finite set U of cardinality n, for each element 
Ui G U a, weight Wi G and a value Vi G ,\!i G {1,2, ...,n}, a target weight 
B G Z^ , and a target value K G Z^ . The problem asks whether there exists a 
binary vector X = {xi, X 2 , ■■■, x„} such that X)r=i — 

K. Given such an instance, we construct a star network using the following 
transformation: W = n, C = maxi{wi -I- Ui) -I- 1, Q = K + — Wi — Vi) 

and the traffic matrix is represented on the Figure 1. In this figure the nodes 
are the nodes of the star network (the hub is not represented), and the links 
represent the traffic demands. Traffic demands equal to 0 are not represented, 
and the value on the link from a node a to a node b is ta^b- Nodes from n -I- 1 to 
n -I- 10 represent each one a node of the network, but nodes ia,ja, ka, la, rna,Pa 
and Qa represent each one several nodes: 
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For the nodes ia, a ranges from 1 to n (i.e. ia represents the nodes ii, i 2 , in)', 

for the nodes ja, a ranges from 1 to 
for the nodes ka, a ranges from 1 to 
for the nodes I a, a ranges from 1 to 

for the nodes nia, a ranges from 1 to mod (c i)) j, 

for the nodes Pa, a ranges from 1 to 

r 2.1 1 r 1 2 . I nC—n{(n — 2)C mod (C— 1)) i 

for the nodes qa, a ranges from 1 to [ 

and for the nodes r„, a ranges from 1 to J . 



C-i 




Ta 

n + 10 



Pa 

n + 7 



qa 

n + 8 



ka 



n + 4 



Fig. 1. Illustration of the traffic matrix. Transmitters are on the left and receivers on 
the right 



Lemma 1. Let a be a transmitter and b a reeeiver. It is not possible to have a 
lightpath from a to b, if (a, b) ^ (n+ 1, i^) or (a, 6) yf (n + 2, ia) ■ 
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Proof. Let us show that each traffic demand different from 0 between each couple 
(a, b) of nodes in Vi x V 2 ((a, b) (n+ 1, ia) and (n + 2, ia)), cannot be optically 
routed. In order to show that it is not possible to route ta^b optically, we will 
see that either the sum of the traffic streams from a, or the sum of the traffic 
streams to b, is equal to nC, and that fa, 6 is smaller than C. 

— Vx G V 2 ,tn+ 9 ,x cannot be optically routed. Indeed: 

Sxevb ^"+9, a; = S/3 ^n+9,rp + tn+9,n+10 P S/3 ^n+9,i/3 

= (C* - 1) + (nC - X;Li Wk) mod (C - I) 

+ SLi^fc 
= nC 

and tn+9,rQ, ^ C-} ^n+9,n+10 ^ ^ i ^n-\-9,ia ^ ^ ■ 

— Vx G cannot be optically routed. Indeed: 

SxeVG “ S/3 + S/3 + '^ja,n+7 

= n{C - 1) + (C - 1) + n mod (C - I) 

= nC 

and ^ Cj ^ja.P /3 ^ ^ 

— \/x G V 2 ,tn+ 3 ,x cannot be optically routed. Indeed: 

SxeVb ^"+3, a; = S/3 ^n+3,ip + S/3 ^"+3,9/3 fn+3,n+8 

= n((n-2)Cmod (C- I)) + (C-I) 

+ {nC — n((n — 2)C' mod {C — I))) mod {C — I) 

= nC 

and t^_|_ 3 < d7, < d7, t ^_|_3 < C. 

— \/x G V\,tx,ka cannot be optically routed. Indeed: 

SxeVi ^a:,fcc = tn+l.ka S/3^G>fcc« + ^n+5,feo 

= (c - 1) + pg-(g-i) j (c - I) + (nC - (C - I)) mod (C - I) 
= nC 

and tn-\-l,kc^ ^ ^1(3, ka ^ ^7 ^n+5,fca: ^ 

— Vx € Vi,ta;,n +4 cannot be optically routed. Indeed: 

SxeVi ^x,n+i = S/3 ^m^,n+4 H“ ^n+6,n+4 H“ ^n+l,n+4 

^ mod (C-D) ^ - ((ELl 

B)) mod (C-I))J) mod (C'-l) + (X:Si(wfc-B)) mod (C-I) 
= nC 

and < C, tm/^ ,n+4: ^ C, ^ C. D 

Lemma 2. Let a G {I,...,n}. Traffic demands tn+i,ia kind tn+ 2 ,ia cannot be 
optically routed simultaneously. 

Proof. The node ia receives from the hub a total traffic equal to: + 

S/3^i/3,G + C+3,ic = cui^ + L ^"c-|^ J(C - 1) + {{n - 2)C) mod (C - 1) = 
(n — 2)C + Wi^ > (n — 2)C 

Since W = n, there is at most one wavelength left to have a lightpath to the 
node ia. □ 
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Lemma 3. Let a G {1, n}. It is possible to have a lightpath from n+1 to i^, 
or from n + 2 to ia- 

Proof. — Let us show that it is possible to have a lightpath from n + 1 to 
J2xeVi,x^n+i = {n- 2)C + Wi^ + {C - Wi^ - ViJ = {n - l)^ - Vi^ < 
{n-l)C 

and, since 3ia G {1, B > Wi^ (otherwise the instance of the Knap- 
sack problem would be trivial) and C = maxi^{vi^ + -I- 1, we have 

Y.x(^v- 2 ,x^tc. ^"-1-1.^ = Ylk=ii'^k - B) < {n- l)C. So there is enough band- 
width to have a lightpath from n -I- 1 to ia- 
— Let us show that it is possible to have a lightpath from n -I- 2 to to,: 
'Zx&v^,x^u +2 = (n - 2)C -k Wi^ +{C-WiJ = {n- l)C < (n - l)^ 

and ^'n+‘^,x = 0 < ~ ^)C. So there is enough bandwidth to 

have a lightpath from n -I- 2 to Iq,. □ 

Since tn-i-i.ic tn+2,ia G {!) Etre the only traffic demands which 

can be optically routed (Lemma 1) and since, for each a G {1, ..., n} it is possible 
to have a lightpath from n -I- 1 to Zq,, or from n -I- 2 to ia, (Lemma 3) but not 
both (Lemma 2), we need only to consider solutions in which there is a lightpath 
from exactly one of the nodes rz -I- 1, n -I- 2, to each node ia to determine the 
satisfiability of the instance. 

Let X denote a candidate solution of the Knapsack instance. Consider the 
solution of the grooming problem in which X (respectively, X) represents the 
indicator vector of the lightpaths formed from node rz -I- 1 (resp., rz -I- 2). Nodes 
ia are numbered from 1 to rz: let a G {1, ...,rz}, we have ia = ce. Applying the 
transformation to the satisfiability criteria of Knapsack, we obtain: 

1 ^n+l,z) ^n+l,z) (^n+l,n+4 ’^2/3 ^n+l,fc/ 3 ) 

^^ 2— 1 (^n+l,n+4 + ’^ptn+l.kp) ^ {n 

This inequality means that the amount of electronically routed traffic de- 
mands (the left hand side of the inequality) has to be smaller than or equal to 
the capacity of a link, C, times the number of links available (i.e. rz minus the 
number of traffic demands which are electronically routed). 

Z)”=l ^ ^ 

^ 1 ^i{tn-\-l^i tn-\-2.i) ^ Q 1 ln+2.i 

itn+l,i “k Xitn-\-2.i) P Q 

This inequality means that the total amount of optically routed traffic has 
to be greater than or equal to Q. 

Therefore, a vector X either satisfies both the Knapsack and the grooming 
instance, or fails to satisfy both. Hence, the grooming instance is satisfiable if 
and only if the Knapsack instance is. 

Theorem 1. The decision versions of the minimization and the maximization 
versions of the grooming problem in a passive star are NP-Complete. 
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Proof. We already proved that the decision version of the minimization version 
of the grooming problem in a passive star is NP-Complete. Since we can easily 
switch from a version to the other one {OPTmax = {J2i(^Vi,j&V2 “ OPT^in), 

the decision version of the maximization version of the grooming problem in a 
passive star is also NP-Complete. □ 

3 Polynomial Time Algorithm for W = 2 

Let us show that the grooming problem in a passive star is polynomially solvable 
when the number of wavelengths on each fiber, W, is equal to 2. We will give a 
polynomial time algorithm which gives an optimal solution of this problem. 

First of all, let us remark that in each row (or column) of T where there is 
at least three values different from 0, we can at most route one traffic demand 
optically, because each lightpath which is not electronically switched at the hub 
needs a wavelength for him only, and we have only two wavelengths per fiber. 
On the contrary, when there is in a row (or column) only two values different 
from 0, it may be possible to route optically both. We transform the matrix T in 
a matrix T' in which it is possible to route optically at most one traffic demand 
for each row, and one traffic demand for each column: if a row of T has two 
and only two values tij and tif different from 0, we create two rows il and i2 
in T' such that = tij, = ^ij' the other values in these rows are 0. 
Similarly, if a column of T has two and only two values and different from 

0, we create two columns jl and j2 in T' such that t'i = tij, t ' ^2 = 

the other values in these columns are 0. In this way, there is at most one request 
per row and one request per column which can be optically routed. 

Let us now transform our traffic matrix T' in another matrix M = [m^], in 
which we will look for a maximum weighted matching. In order to do that, we 
will apply the following rule: if a traffic demand fb cannot be optically routed 
(i-e- llk^jt'ik > C or Y.k^^t'kj > then my = -oo. Otherwise, my = t'y. 
Since there is in M at most one request per row and one request per column 
which can be optically routed, and since a traffic demand my is different from 
— oo if and only if it is allowed to be optically routed, the result of a maximum 
weighted matching in the bipartite graph whose adjacent matrix is M, is an 
optimal result for the traffic grooming problem whose traffic matrix is T. 

Theorem 2. The minimization and the maximization versions of the traffic 
grooming problem in a passive star are polynomially solvable if the number of 
wavelengths per fiber is equal to two. 

Proof. The optimal solution of the minimization version and the optimal solu- 
tion of the maximization version are the same, and the algorithm above solves 
polynomially these problems. □ 

4 Approximation Algorithms 

Theorem 3. It is not possible to deduce a constant approximation guarantee 
for the maximization (resp. minimization) version of the traffic grooming prob- 
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lem in a passive star network from a constant approximation algorithm for the 
minimization (resp. maximization) version. 



Proof. Let OPT^ax be the cost of an optimal solution of the maximization 
version (i.e. the maximum amount of traffic which can be optically routed) and 
OPTjnin the cost of an optimal solution of the minimization version (i.e. the 
minimum amount of traffic which is electronically switched at the hub in a 
feasible solution) . Let S' be a solution of the traffic grooming problem in a passive 
star. Let denote Cmax{S) the cost of the maximization version (i.e. Cmax{S) is 
the amount of traffic which is optically routed in the solution S) and Cmin{S) 
the cost of the minimization version (i.e. Cmin{S) is the amount of traffic which 
is electronically switched at the hub in the solution S). 



Let £i and £2 be two real numbers such that 0 < £1 < 1 and 0 < £ 2 . Let 
us show that it is not possible to deduce a (1 — £1 (-approximation guarantee 
for the maximization version from a (1 -I- £2 (-approximation algorithm for the 
minimization version: Consider an instance / of the problem, where we can 
at most route only one traffic stream optically {OPTmax = 1 and OPTmin = 
ieVa ~ !)• Consider a solution S' of / where all the traffic demands are 
electronically switched at the hub {cmax(S) = 0 and Cmm(S) = J 2 teVijeV 2 ^b)- 



If E*eyi.ieF 2 ^b is large enough, we have 



iGViJ€V2 



but there is no £1 < 1 such that ^ is greater than or equal to 1 — £ 1 . 

Let us show that it is possible to have an instance such that j^y^ Uj 

is as big as we wish and where we can at most route one traffic stream opti- 
cally: consider the instance where we have 2W transmitters, 2 receivers, and 
the traffic matrix [tij] is such that ti^\ = 1; Vf € {2,3, ...,2W}, ti^i = 0; and 
Vi € {1, ...,2W}, ti 2 = y. The only traffic demand which can be optically 
routed is because the second receiver receives WC traffic streams, and each 
traffic demand is different from C. 

Similarly, let us show that it is not possible to deduce a (l-|-£ 2 )-approximation 
guarantee for the minimization version from a (1 — £i)-approximation algorithm 
for the maximization version: Consider an instance / of the problem, where 
all the traffic demands can be optically routed {OPTmax = jGV^ ^b 

OPTmin = 0). It is trivial that such an instance exists. Consider a solution S 
of / where all the traffic streams, except two, are optically routed {cmax{S) = 
iJ2iGVi,jGV2 fb) “ 2 and Cmin{S) = 2). If J2t(^Vi,j&V2 % I® enough, we have 
cmaxl-g) _ (S.svi.jgva bj) ^ 1— £i but there is no £2 > 0 such that — - 



■€ViJeV2 



■yni < 1 + £2 



OPTrr. 



E. 



evi.jeva '-'3 



OPT„, 



is smaller than or equal to 1 -I- £ 2 . 



4.1 Approximation Algorithm for the Minimization Version 

Let us give an integer programming formulation of the minimization version of 
the traffic grooming problem in a passive star. Let denote Xy G {0, 1} a variable 
which indicates whether the traffic demand is optically routed (xy = 0) or 




Traffic Grooming in a Passive Star WDM Network 



9 



electronically switched at the hub {xij = 1). The objective is to: 



Minimize E Xijtij (1) 

*eVi,iGV2 

We have two types of constraints: 
constraints on the frequencies: 

>|y2|-Vb (2) 

3 &V 2 

yj €V2,J2 x,j>\Vi\-W (3) 

iGVi 

constraints on the traffic: 

Vz G Fi, ^ (1 - x,j){C - Uj) <WC-J2 (4) 

ieiG ieVa 

Vj G V 2 , E “ a^b)(C - Uj) < WC - ^ tij (5) 

iGVi ieVi 



Inequalities (2) (resp.(3)) mean that at most W traffic demands per trans- 
mitter (resp. receiver) can be optically routed, because we need one wavelength 
for each traffic demand optically routed. Inequalities (4) and (5) mean that the 
unused space (C — tij) left when tij is optically routed, has to be smaller than the 
free space available (i.e. the total amount of bandwidth minus the total amount 
of traffic demands) . It is also equivalent to say that the amount of electronically 
routed traffic demands has to be smaller than the capacity of a link, C, times 
the number of links available (i.e. W minus the number of traffic demands which 
are optically routed). 

Constrains (4) and (5) are equivalent to: 

Vz G Cl, ^ Xi,{C - Uj) > C(\V2\ - W) (6) 

3 &V 2 

Vj G M 2 , ^ Xi, (C - ti,) > C(|Mi I - W) (7) 

*eVi 

Note that the constraints on the traffic imply the constraints on the frequen- 
cies: if we divide by C the constraints on the frequencies we have: 

VzGMi,^a:y(l-^)>|M2|-Vb (8) 

ieVG 

VjGM2,E^b(l-^)>|4V|-IM 

ieVi 



(9) 
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Since Uj < C, these last inequalities imply the constraints on the frequencies 
(2) and (3). 

So we can formulate our problem in the following way: 

Minimize E 

leVi.ieEi 

subject to: 

Vz G Fi, ^ Xi,{C - Uj) > C(\V 2 \ - W) (11) 

ieiG 

Vj G 1/2, ^ XijiC - Uj) > C(|l/i| - W) (12) 

leVi 

VzGyi,jGl/2,a:y G{0,1} (13) 

Theorem 4. There exists a polynomial time H 2 C -appToximation algorithm for 
the minimization version of the traffic grooming problem in a passive star. 

Proof. We will transform the minimization version of the traffic grooming prob- 
lem in a passive star into a constrained multiset multicover problem. Given a 
universal set a collection of subsets oiU, T = {S\, 82 ^ .■■, Sk}, and a cost func- 
tion c : T ^ Q/, the set cover problem asks for a minimum cost sub-collection 
C £ T that covers all the elements of U (i.e. IJsec S = U). The multiset multi- 
cover is a natural generalization of the set cover problem: in this problem each 
element e occurs in a multiset S with arbitrary multiplicity denoted m{S,e), 
and each element e has an integer coverage requirement Vg, which specifies how 
many times e has to be covered. In the constrained multiset multicover prob- 
lem, each subset S' G T is chosen at most once. Thus the integer program is: 
Minimize c(S)xs subject to '^m{S,e)xs > rg and xs G {0, 1}. 

Let us show the transformation of the traffic grooming problem into the 
constrained multiset multicover problem. This transformation comes directly 
from the integer programming formulation of the problem: for each request from 
the transmitter z G Vi, denoted by Ci, to the receiver j G V 2 , denoted by rj, we 
create the subset S^ which contains {C — tij) times and (C — tij) times Vj. 
The cost of this subset is c{Sij) = tij. The covering requirement of each element 
e G V\ is Te = (11(11/21 — W), and the covering requirement of each element e G V 2 
is rg = C(\Vi\-W). 

S. Rajagopalan and V. Vazirani give in [3] a greedy approximation algorithm 
for the constrained multiset multicover problem. This algorithm consists in it- 
eratively picking the most cost-effective set from T and removing this set from 
T. The cost-effectiveness of a set S is the average cost at which it covers new 
elements, i.e. the cost of S divided by the number of its elements which are not 
yet covered. They proved that this algorithm has an approximation guarantee 
of Hk, the fc-th harmonic number (i.e. Hk = 1 -|- i -I-), where k is the size 

of the largest multiset in the given instance. In our case, the size of a multiset 
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Sij is 2C — 2tij , which is smaller than 2C. So we obtain a solution of the traffic 
grooming problem which has an approximation guarantee of i? 2 C ^ log{2C) + 1. 

□ 



4.2 Approximation Algorithm for the Maximization Version 

The maximization version of the traffic grooming problem in a passive star is 
the following one: 



Maximize E VijUj 

ieVi,ieV2 



(14) 



subject to: 

Vt G Vi, ^ 2/y (C - Uj) <CW~Y, (15) 

ieiG ieiG 

Vj G V 2 , ^ ViiiC - tij) < CW - ^ Uj (16) 

ieVi iGVi 

VzG Vi,j G V2,y.i G {0,1} (17) 

Here jjij indicates whether tij is optically routed {jjij = 1) or electronically 
switched at the hub (y^ = 0). This integer programming formulation of the 
problem is obtained by replacing Xij in the integer programming formulation of 
the minimization version by 1 — yij . 



Theorem 5. There exists a polynomial time (2 + -approximation algorithm 
for the maximization version of the traffic grooming problem in a passive star. 



Proof. Let us now transform this problem into a demand matching problem [4]. 
The demand matching problem is the following one: take a graph G = (V, E) and 
let each node v GV have an integral capacity, denoted by by. Let each edge e = 
(u,v) G E have an integral demand, denoted by de. In addition, associated with 
each edge e G if is a profit, denoted by Pe- A demand matching is a subset MCE 
such that X)eGi 5 (v)nM for each node v. Here 6{v) denotes the set of edges 

of G incident to v. The demand matching problem is to find a demand matching 
which maximizes X)eeMPe- Thus the integer program is: Maximize X)ee£; 
subject to: Vv G V, ^ee<5(v) G E,Xe G {0, de}. 

B. Shepherd and A. Vetta showed in [4] that a randomized algorithm provides 
a factor (2 + |)-approximation guarantee for the demand matching problem in 
bipartite graphs. 

Let us show the transformation of the traffic grooming problem in a passive 
star into the demand matching problem: Ve G {Vi x V 2 ), Xe = ye{G — te) , Pe = 
te , de = C — te. Vu G V} , by = GW — tyj and Vu G V 2 , by = CW — 

Since there is a factor (2+|)-approximation algorithm for the demand match- 
ing problem, there is also a factor {2-\- |)-approximation algorithm for the traffic 
grooming problem in a passive star. □ 
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5 Conclusion 

We showed in this paper that the traffic grooming problem in a passive star 
is NP-Complete, in both the minimization and the maximization versions of 
the problem. We showed that these problems are polynomially solvable if there 
are two wavelengths per fiber: we gave an algorithm which gives an optimal 
solution. We showed that we cannot deduce a constant approximation guarantee 
of the maximization (resp. minimization) version from a constant approximation 
guarantee of the minimization (resp. maximization) version of the problem. We 
gave two approximation algorithms and we obtained an approximation guarantee 
of H 2 C for the minimization version and an approximation guarantee of (2+|) for 
the maximization version. Since the solutions returned by these algorithms are 
all solutions of the maximization version as well as solutions of the minimization 
version of the problem, it would be interesting to program these algorithms and 
compare their results. 
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Abstract. In this paper we consider all-optical networks in which a 
service provider has to satisfy a given set of communication requests. 
Each request is charged a cost depending on its wavelength and on the 
wavelengths of the other requests met along its path in the network. 
Under the assumption that each request is issued by a selfish agent, we 
seek for payment strategies which can guarantee the existence of a pure 
Nash equilibrium, that is an assignment of paths to the requests so that 
no request can lower its cost by choosing a different path in the network. 
For such strategies, we bound the loss of performance of the network 
(price of anarchy) by comparing the number of wavelengths used by the 
worst pure Nash equilibrium with that of a centralized optimal solution. 



1 Introduction 

In the last years optics has emerged as a key technology in communication net- 
works due to the promise of data transmission rates several orders of magnitudes 
higher than conventional technologies [3, 11, 12, 19, 20]. 

All-optical communications networks exploit photonic technologies for the 
implementation of both switching and transmission functions [10]. These systems 
provide all source-destination pairs with end-to-end transparent channels that 
are identified through a wavelength and a physical path. Maintaining the signal 
in optical form allows for high data transmission rates in these networks since 
there is no conversion to and from the electronic form. 

The high bandwidth of the optical fiber is utilized through the wavelength- 
division multiplexing (WDM) [4] approach in which two signals connecting 
source-destination pairs may share a link, provided they are transmitted on 
carriers having different wavelengths (or colors) of light, see [1, 9] for a survey of 
the main related results. 

We consider all-optical networks in which a service provider has to satisfy a 
given set of communication patterns, where a pattern can be seen as a request 
for a connection made by a source-destination pair. Each pair is assigned a 
path on the network and is charged a cost depending on its wavelength and on 
the wavelengths of the other requests met along its path in the network. We 
assume that each request is issued by a selfish agent who is interested only in 
the minimization of his own cost. Under this assumption any request is willing 
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to be rerouted each time it may be served by a cheaper path in the network 
and the evolution of the network can be modelled as a multi-player game. A 
routing strategy, that is an assignment of paths to the requests, in which no 
request can lower its cost by choosing a different path is said to be a pure 
Nash equilibrium. Several games [5,8,15,21,23] have been shown to possess 
pure Nash equilibria either by using an inductive argument or by defining a 
suitable potential function decreasing each time a player performs an improving 
defection. The latter approach, in particular, is useful to prove a stronger result, 
that is the convergence of the game, independently from its starting state, to a 
pure Nash equilibrium, see [6] for an interesting discussion. 

Clearly, because of the lack of cooperation among the players, Nash equilibria 
are known not to always optimize the overall performance, with the Prisoner’s 
Dilemma [17, 18] being the best-known example. To this aim, in [13] such a 
loss of performance is defined as the ratio between the cost of the worst Nash 
equilibrium and that of an optimal centralized solution and is called price of 
anarchy or coordination ratio. Bounding the price of anarchy of selfish routing 
in different models is now arising as one of the most interesting research areas 
lying on the boundary between Computer Science and Game Theory, see for 
example [14,22]. 

In this paper we present four reasonable paying strategies that a service 
provider can adopt in all-optical networks in order to keep the overall perfor- 
mances of the network (modelled as the number of used wavelengths) acceptable, 
assuming that users are selfish agents who may not cooperate. We determine 
which payment strategy induces a game converging to a pure Nash equilibrium. 
For these such strategies, however, we show instances for which the price of 
anarchy is arbitrarily high. 

The paper is organized as follows. In the next section we give the basic 
definitions and notation used throughout the paper, in section 3 we present 
some results for the proposed cost measures and finally, in the last section, we 
address some interesting open problems. 



2 Definitions and Notation 

We model a network topology by an undirected graph G = (V, E) where V 
represents the set of sites and each undirected edge {x,y} € E between two 
nodes x,y represents a link between the sites. 

In all-optical communication networks, sites (nodes) communicate via send- 
ing signals of different wavelengths (colors) along optical fiber lines (undirected 
edges). A point-to-point communication requires to establish a uniquely colored 
path between the two nodes whose color is different from the colors of all the 
other paths sharing one of its edges. 

In the sequel we will use the following notation: 

— P{x, y) denotes a path in G from the node x to y, that is a set of consecutive 
edges beginning in x and ending in y. 
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— A request is a source-destination pair of nodes {x, y) in G (corresponding to 
a communication between x and y) . 

— An instance / is a collection of requests. Note that a given request {x, y) can 
appear more than once in an instance. 

— A routing R for an instance / in G is a set of paths R = {P{x, y)\{x, y) G /}. 
Let Cfl be the color function associated to the routing R, i.e. given {x, y) G /, 
cn{{x,y)) is the color of the path P{x,y) G R and c{R) is the number of 
colors used by R. 

— The conflict graph associated to a routing R is the undirected graph {R, E) 
with vertex set R and such that two paths of R are adjacent if and only if 
they share an edge of G. 

— The set of available wavelengths (or colors) is PL ={1,2,3,...}. f :W ^ M 
is a pricing function for the colors. Without loss of generality we assume / 
non decreasing and positive. 

— We denote as p a payment function that, given a pricing function on the 
colors / and a routing R, associates a cost p{{x, y)) to any request {x, y) G I. 

Let G be a graph and / an instance. The problem (G,I) asks the determi- 
nation of a routing R for the instance / and assigning each request (x, y) G I a, 
wavelength, so that no two paths of R sharing an edge have the same wavelength. 
If we think of wavelength as colors, the problem (G, I) seeks a routing R and a 
vertex coloring of the conflict graph (i?, E), such that two adjacent vertices are 
colored differently. 

A game with n > 2 players is a finite set of actions Si for each player, and 
a payoff function Ui for each player mapping x . . . x S'„ to the integers. The 
elements of S'! x . . . x S'„ will be called action combinations or states. A pure Nash 
equilibrium is a state s = (si, . . . , s„) such that for each i, Ui{si, . . . , Si, . . . , s„) > 
Mi(si, . . . , s', . . . , Sn) for any s' G Si. In general a game may not have pure Nash 
equilibria, however, Nash [16] proved that if we extend the game to include as 
strategies for i all possible distributions on Si, with the obvious extension of the 
Mi’s to capture expectation, then an equilibrium is guaranteed to exist. 

Consider a graph with node set Si x . . . x Sn and an edge (s, s') whenever 
s and s' differ only in one component, say the i-th, and Mi (s') > Mi(s). We call 
such a graph the Nash dynamics and the edge (e, e') a move of player i. The 
Nash dynamics models the evolution of the game as a sequence of moves. If it 
admits a sink then the game has a pure Nash equilibrium, while if it is acyclic 
then the game is always guaranteed to converge to a pure Nash equilibrium after 
a finite number of moves. 

Given a graph G = (V, E), an instance I, a pricing function / and a payment 
function p we define as {G,I,f,p) the following game. We have r = |/| players, 
where each player i is a request {x, y) G I, the set of actions for player i, Vz = 
l,...,r, is Si = P{x,y) and the utility function is Ui = —p{{x,y)). Let R* 
denote a routing minimizing the number of colors for the problem (G, I) and let 
TZ denote the set of pure Nash equilibria for the game (G, I, f,p), if TZ ^ 0 the 
price of anarchy of the game is defined as p(G, I, f,p) = supR^n c(r») ■ 
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3 Results for Different Payment Functions 

In this section we present different payment functions p and establish which of 
them give raise to a game possessing a pure Nash equilibrium. For such functions 
we also give a lower bound on the corresponding price of anarchy. 

The payment functions we will consider in this section are the following: 

- pi{{x,y)) = f{cii{{x,y))), that is, a request (x,y) pays for the price of the 
color used to communicate. 

- P2{{x,y)) = maXeeK((a;,j/))max(a_{,)g/|e6K((a,6)) /(cfi((a, 6))), that is, a re- 
quest (x, y) pays for the maximum price of all the colors used by any other 
request sharing an edge with the path used by (x,y). 

- P3{{x,y)) = J2eGRi{x,y))^^^{o.,b)Gi\eGR{ia,b)) f{cR{{a,b))), that is, a request 
{x, y) pays for the sum over all the edges belonging to the used path of the 
maximum price of the colors passing through the edge. 

- PA{{x,y)) = Eegfl((x,y)) max(,^fc)gj|,gfl((,,fc)) > that is, a 

request (x, y) pays for the sum over all the edges belonging to the used path 
of the maximum price of the colors passing through the edge, divided by the 
number of request sharing the edge. 



3.1 The Payment Function pi 

The first payment function we consider is, given (x, y) G I and a routing R, 
Pi{{x,y)) = f{cn{{x,y))), that is, a request {x,y) pays for the price of the color 
used to communicate. 

The following theorem shows that for this payment function a Nash equilib- 
rium is always achieved in a polynomial number of moves. 

Theorem 1. The game (G, /, /, pi) converges to a Nash equilibrium in at most 
r^ moves. 

Proof. Since any move of a request (x, y) does not affect the payment of any other 
request, at most r^ moves are possible until a Nash equilibrium is achieved. In 
fact, being the number of needed colors at most equal to r and assuming that 
no request make stupid moves using a color w > r, each request can move at 
most r times. □ 

This payment function, however, in unable to guarantee, in general, good 
performances to the network. Indeed, there exist graphs G and instances I for 
which the price of anarchy can be very high, as shown in the following theorem. 

Theorem 2. For any pricing function f, there exist G and I such that 
p{G,I,f,pi) = r. 

Proof. Consider a ring having r nodes and the instance I = {(ui, ^(^+1)7710^ r)l* € 
{!,..., r}}. In the routing R* using the same edge {vi,V(i+i)rnod r} for each 
request {vi, U(i_|_i)„ 7 od r), we have that cr* = 1, that is only one color is used. On 
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the other hand, the routing R that uses, for each request (uj, U(j_|_i)mod r)> the 
path {Ui, V(,_i)mod r}, • • • , {v(i+ 2 )mod r, V{i+i) 7 nod r}, we have that CR = r. Since 
the routing i? is a Nash equilibrium, the claim follows. □ 

Such a bad result is justified by the following consideration. Given a routing 
R, finding a minimum assignment of wavelengths to the paths in R is equivalent 
to finding a minimum coloring of the conflict graph induced by R. Since the 
problem of coloring a graph G = (V, E) cannot be approximated within \V\^ [2] 
and a Nash equilibrium achieved in polynomial time is an approximated solution 
to graph coloring, being |i?| = |J| = r, the price of anarchy cannot be better 
than r r . 

One of the drawbacks of the payment functions p\ is that each request {x, y) 
is charged a cost depending only on the color it uses, without considering the 
fact that all the other requests sharing an edge with the path used by (x,y) 
are not allowed to use the same color. For such a reason one can expect that 
designing payment functions taking into account all the colors passing through 
a particular path may yield games with a better price of anarchy. 

3.2 The Payment Function p 2 

The second payment function we consider is, given (x, y) € I and a routing 
R, P 2 {{x, y)) = T[^&y^e&R((x,y)) max(a_,,)g j|egfi((a,6)) /(cfi((a, 6))), that is, a request 
(a;, y) pays for the maximum price of all the colors used by any other request 
sharing an edge with the path used by (x,y). 

For the payment function p 2 is also possible to prove the convergence to a 
Nash equilibrium even though the number of moves may be exponential. 

Theorem 3. The game (G,I,f,p 2 ) converges to Nash equilibrium in a finite 
number of moves. 

Proof. Let be the sequence of colors CR{{x,y)), V(a;,y) G I listed in non in- 
creasing order. Any move of a player i, from a routing i? to a routing i?' lowering 
his payment from p to p' , can make the payments of the other players risen to 
reach at most the value p' . Thus, letting (3' be the sequence of colors CR’{{x,y)), 
y{x,y) G I listed in non increasing order, we have that /3' is lexicographically 
smaller than fi. Thus, being the set of the possible sequences finite, an equilib- 
rium must be achieved. □ 

It would be interesting determining a graph G and an instance / creating 
the situation described in the above theorem, thus effectively implying the non 
polynomial time convergence in general. 

However, also for this payment function, we cannot expect in general a low 
price of anarchy. In fact, using the same problem (G, /) defined in the proof of 
Theorem 2, it is possible to prove the following result. 

Theorem 4. For any pricing function f , there exist G and I such that 
p{G,I,f,P 2 ) = r. 
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3.3 The Payment Function pa 

The third payment function we consider is, given (x, y) € I and a routing 
R, P3{{x,y)) = J2eeRax,y))^^Ha,b)ei\eeRaa,b)) f{cR{{a,b))), that is, a request 
(x,y) pays for the sum over all the edges belonging to the used path of the 
maximum price of the colors passing through the edge. 

Adding a summation in the payment function clearly tightens the interaction 
between the requests. If from one side this is expected to improve on the price 
of anarchy, from the other one the game could become more complex. The next 
theorem captures this last situation. 

Theorem 5. For any unbounded pricing function of the colors f the game 
{G,I,f,ps) in general does not converge. 

Proof. Consider the graph G = (V, E) in which V = {a, 5, c, vi, V 2 , ■ . ■ , Vh-i, 
wi, W 2 , ..., Wk-i} and E = {{a, 6}, {b,c}, {a,ui}, {ui,U 2 }, ..., {vh-i,b}, 
{6, wi}, {wi,W 2 }, ..., {wfe_i,c}} and the instance I = {(a, c), (a, 6) , (b,c) } 

n times m times 

as shown in Figurel. The initial routing R assigns the colors {l,2,...,n} to 
the edge {a, 6} for the n requests (a, 6), the colors {l,2,...,m} to the edge 
{6, c} for the m requests (6, c) and the color n + 1 to the path {a,b}, {b,c} for 
the request (a, c). If the conditions outlined below hold, the following cycle will 
occur in the Nash dynamics graph: all the m requests (6, c) move on the path 
containing the “w” nodes (i.e. the path {6, wi}, {wi, W 2 }, • ■ • j {wfc_i, c}) using 
the colors {1, 2, . . . , m}; the requests (a,c) moves on the path containing the 
“v” nodes (i.e. the path {a, ui}, {ui, U 2 }, ■ • ■ j {vh-i,b}, {b, c}) using the color 1; 
the m requests (b,c) move on the path containing the only edge {b,c} using 
the colors {2, 3, . . . , m, m + 1}; the request (a, c) moves on the path {a, b}, {b, c} 
using the color n + 1. The first technical condition is n > m > 1. The condition 
2 implies that a request (a, b) never moves on the path containing the “v” nodes: 
/(n+1) < h-f{l). The condition 3 implies that a request (5, c) moves on the path 
{6, c} if it pays at most for the color m+1: /(m + 1) < k ■ f{l). The condition 
4 ensures that the request (a, c) never moves on the path containing the “w” 
nodes: 2/(n+ 1) < (/i+ fc)/(l). The condition 5 implies that the request (a, c) 
can move on the path containing the “u” nodes only if it pays for the color 1: 
/(n+1) < (/i+l)/(m+l). The condition 6 implies that a request (6, c) moves on 
the path containing the “w” nodes if it pays for the color n + 1 on the path {b,c}: 
k • f{m) < f{n + 1). The condition 7 implies that the request (a, c) moves on the 
path containing the “u” nodes when it can pay for the color 1 and the edge {6, c} 
is free: {h + 1)/(1) < 2/(n + 1). The last condition 8 implies that the request 
(a, c) moves on the path {a, 5}, {b,c} when the requests (b,c) are routed on the 
path {6, c}: 2/(n+ 1) < h- f(l) + f(m+l). All the conditions hold if m is chosen 
such that /(to+1) > 2/(1), n such that /(n+1) > ^ that 

2/(„+l)-/(^+l) ^ ^ ^ ^ fi^ ^ ^ ^ □ 

Even though it may seem that the above theorem proves the non-existence 
of a Nash equilibrium in general, it must be noted that this is not the case. 
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I = {(a,b), 1 

, V n times 

(a,b), J 

(b,c), 1 

, > m times 

(b,c), J 

(a,c)} 



Fig. 1. 



The following solution, in fact, is a Nash equilibrium: the request (a, c) is on the 
path {a, 6},{6, c} with color 1, the n requests (a, 6) are on the edge {a, 6} with 
colors {2 , . . . , n -I- 1} and the m requests (6, c) are on the edge {b, c} with colors 
However, we conjecture the non-existence of a Nash equilibrium 
in the general case. Our opinion is based on the fact that the majority of the 
existence proofs known in the literature exploits the acyclicity of the Nash dy- 
namics graph. Moreover, very often the presence of a cycle in such a graph has 
been used to suitably redefine the game in order to show the non-existence of 
the Nash equilibrium. 



3.4 The Payment Function p4 

The last payment function we consider is, given (x, y) € I and a routing R, 
P4{{x,y)) = EeeR{(x,y)) ^^^{a,b)ei\eeR{{a,b)) | |(a,fc)ejfe?(a,fc))|| > ^^at is, a re- 
quest (x, y) pays for the sum over all the edges belonging to the used path 
of the maximum price of the colors passing through the edge, divided by the 
number of request sharing the edge. 

Theorem 6. For the pricing function of colors f{w) = , the game (G, I, /, 

Pa) in general does not converge. 

Proof. Consider the graph G = (V,E) in which V = {a, 5, c, vi, V2, . ■ . , ^^59, 
wi, W2} and E = {{a,b}, {b,c}, {a,vi}, {vi,V2}, ..., {v 59 ,b}, {b,wi}, {wi,W2}, 
{w2, c}} and the instance I = { (a,b) , (6, c), (a, c)} as shown in Figure2. The ini- 

9 times 

tial routing R assigns the colors {1, 2 , . . . , 9} to the edge {a, b} for the 9 requests 
(a, b), the color 1 to the edge {b, c} for the request (5, c) and the color 10 to the 
path {a, 6}, {b,c} for the request (a, c). The game will be stuck in the following 
cycle: the request (6, c) moves on the path containing the “w” nodes (i.e. the 
path {6, wi}, {wi, W2}, {w2, c}) using the color 1; the request (a, c) moves on the 
path containing the “v” nodes (i.e. the path {a, Ui}, {ui, U2}, . . . , {ug, b}, {6, c}) 
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using the color 1; the request (6, c) moves on the path containing the only edge 
{b,c} using the color 2; the request (a,c) moves on the path {a, 6},{6, c} using 
the color 10. □ 




, > 9 times 

(b,c), 

(a,c)} 



Fig. 2. 



Clearly, the same observations proposed with respect to Theorem 5, hold also 
in this case, even though it would be nice to extend the result to any pricing 
function /. 

4 Conclusions 

In this paper we have presented four reasonable paying strategies that a service 
provider can adopt in all-optical networks in order to keep the overall perfor- 
mances of the network (modelled as the number of used wavelengths) acceptable, 
assuming that users are selfish agents who may not cooperate. Our results are 
essentially negative, since we prove that only two functions guarantee the conver- 
gence of the evolutionary game to a pure Nash equilibrium. Moreover, in both 
cases the price of anarchy can be made arbitrarily high. However, it must be 
noted that a good coordination ratio cannot be achieved if the game converges 
in polynomial time since otherwise this would contradict the inapproximability 
results of the relative graph coloring problem. We leave open the problem of find- 
ing a pricing function yielding a game having a low price of anarchy, in the case 
that the convergence to a Nash equilibrium cannot be achieved in polynomial 
time. Another interesting issue if that of proving the non-existence of a Nash 
equilibrium for the cases in which we prove the non-convergence of the game. 

Considering the fact that all of our results are negative, a reasonable step 
could be that of restricting the graph topology (considering for example only 
rings, trees, tori and meshes) or the communication patterns (broadcasting, gos- 
siping). Finally, our model could be extended to cover scenarios in which the 
network performances are measured considering also different metrics as, for 
example, the hardware costs (see [7]). 
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Abstract. An important technique for discovering routes between two 
nodes in an ad-hoc network involves applying the face routing algorithm 
on a planar spanner of the network. Face routing guarantees message de- 
livery in networks that contains large holes, where greedy algorithms fail. 
Existing techniques for constructing a suitable planar subgraph involve 
local tests that eliminate crossings between existing links by deleting 
some links. They do not test whether the deleted links actually create 
some crossings and some of the links are deleted needlessly. As a result, 
some of the routes found in face routing will have an unnecessarily large 
number of hops from source to destination. We consider a new local test 
for preprocessing a wireless network that produces a planar subgraph. 
The test is relatively simple, requires low overhead and does not eliminate 
existing links unless it is needed to eliminate a crossing, thus reducing 
overhead associated with multiple hops. 



1 Introduction 

An ad-hoc network is a network consisting of transmitters, often called hosts, 
that is established as needed, typically without any assistance from a fixed in- 
frastructure. Using wireless broadcasts, each host can communicate with other 
hosts within its transmission range. Typically, not all hosts are within the trans- 
mission range of each other. Thus, communication between two hosts in the 
network is in general achieved by multi-hop routing along a route where in- 
termediate nodes cooperate by forwarding packets. Examples of such networks 
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include sensor, piconet, bluetooth, and home/office networks, and routes in these 
networks have to be constructed on the fly. 

In this paper we consider networks that have the following properties: 1) All 
hosts know the geometric coordinates {x, y) of their location; 2) All hosts have 
the same transmission range R, i.e. any two hosts at distance d < R are able 
to communicate directly; and 3) Communication links are bidirectional. Ad-hoc 
networks satisfying the conditions above are the most common type of ad-hoc 
networks considered in literature. 

A network can be represented by a geometric undirected graph, G = (V, E), 
with nodes representing the hosts of the network, and an edge (link) connecting 
any pair of nodes that can communicate directly. 

Discovering a route between two nodes in an ad-hoc network is an important 
component of current research. In such systems it is vital that route discovery 
uses only local information and is adaptable to the network connectivity An 
important technique for discovering routes between any two nodes in a ad-hoc 
network without the use of flooding is the face routing [2,7] algorithm The face 
routing algorithm succeeds in discovering a route in a networ providing that the 
underlying graph is planar. Since in practice, th original ad-hoc wireless network 
is never planar with many links crossin each other, before using the face routing 
we need to extract from th original network, using a local method, a planar 
connected network spanning the entire underlying network. 

There are two important goals we should be concerned with in our reduction 
from the original wireless network to the geometric planar graph. The first is to 
keep long links and is required so that we can prevent an unnecessary large 
number of hops from source to destination that require extra processing at the 
nodes and may cause failures. Secondly, we must eliminate crossings of links. 
This is required in order to create a planar underlying graph. From a practical 
point of view, the method employed must be efficient and be based on local tests 
and the resulting graph must be a connected planar spanning subgraph of the 
original network. 



1.1 Results of the Paper 

In this paper we consider a method for reducing the overhead of an unnecessarily 
large number of hops from source to destination. 

The Morelia test is a new local test for preprocessing a wireless network that 
produces a planar spanning subgraph of the original wireless network on which 
we can apply face routing. The Morelia test is a generalization of the Gabriel 
test and is relatively simple, requires low overhead and does not unnecessarily 
eliminate existing links if they do not create any crossing, thus reducing the 
overhead associated with multiple hops. The resulting graph is planar and is a 
supergraph of the Gabriel graph [5] . 

In addition to the theoretical justifications of the Morelia test presented in 
this paper, we conducted simulations of our algorithms on randomly generated 
networks so that we can quantify the improvements given by the new method. 
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Fig. 1. Left: The RNG test for producing a planar spanner. Right: Eliminating an 
unnecessary link (dashed line AB) with Gabriel Test 



2 Tests for Reduction to Planarity 



Here we discuss existing tests for planar reduction in wireless networks. In par- 
ticular we consider the Relative Neighbourhood Graph and the Gabriel Graph. 

A planar spanner of a network can be obtained by applying the Relative 
Neighbourhood Graph (RNG) [10] test to every pair of nodes which are within 
the transmission range of each other. Let A, B be two nodes whose distance is 
at most the transmission range R of the network. Gonsider the region delimited 
by the intersection of the circles centered at A and B, respectively, where the 
radius of the circles is R, the power of the stations at A and B, see Figure 1 
(left). If there is no node in the region then the link between A and B is kept. If 
however there is a node C in the region depicted in Figure 1 (left), then nodes A 
and B remove their direct link. In particular, when A (respectively, B) is queried 
on routing data to B (respectively. A) the routing table at A (respectively, B) 
forwards the data through C (or some other similar node if more than one node 
is in the specified region.) The RNG test suffers from the multiple hop effect 
because the elimination of crossings is done by elimination of longer links. 

One of the most important tests for eliminating crossings in a wireless net- 
work is called the Gabriel test [5] which, similarly to the RNG test is applied to 
every link of the network. The main difference between the Gabriel test and the 
RNG test is the smaller size of the region considered for a link elimination. 

Let A, B be two nodes whose distance is less than the transmission range 
R of the network. In the Gabriel test, if there is no node in the circle with 
diameter AB then the link between A and B is kept. If however there is a node 
C in the circle with diameter AB, as depicted in Figure 1 (right), then nodes 
A and B remove their direct link. In particular, when A (respectively, B) is 
queried on routing data to B (respectively. A) A (respectively, B) forwards the 
data through C (or some other similar node if more than one node is in the 
circle with diameter AB). The advantage of doing this rerouting of data is that 
the resulting graph is a planar spanner on which we can apply the face routing 
algorithm for discovering a route from source to destination. 

However, the test merely shrinks the “test region” and creates a planar span- 
ner that keeps some of the links that would be eliminated by the RNG test. 
However, like the RNG test, it does not in any way prevent the multiple hop 
effect. 
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Fig. 2. Multiple hop effect when eliminating a link (line segment AB) via the Gabriel 
test 



For example, consider a set of nodes as depicted in the left-hand side of 
Figure 2. All nodes are mutually reachable. However, when we apply the Gabriel 
test the configuration in the right-hand side of Figure 2 results. We can see that 
although nodes A and B could have reached each other directly in a single hop 
instead they must direct their data through a sequence of many hops. 

We note that the multiple hop effect arises when many nodes are clustering 
in regions. We should note that while the multi-hop effect may result in slower 
message delivery, it may have also a positive effect, since it can decrease the 
power consumption. For example, if instead of using a direct link between A and 
B an intermediate node, say C is used (see Figure 1 (right)) which lies in the 
circle with diameter AB then the power consumption decreases from d(A, B)~p 
to d{A, C)~P -I- d{C, B)~P , for some constant 2 < p. However, this is at the cost 
of additional overhead implied by intermediate hopping. 

3 Planarity and the Multiple Hop Effect 

The purpose of this section is to give an algorithm that eliminates crossings 
but at the same time maintains some “long” links between stations whenever 
possible, thus reducing the number of hops in face-routing. We introduce the 
Morelia test. The Morelia test is an extension of the Gabriel test where the 
algorithm checks for the presence of a crossing before eliminating a link. 

3.1 Morelia Test 

As mentioned in the introduction, we are concerned with the problem of routing 
in networks with complex topology and containing many holes. Both the RNG 
and Gabriel tests eliminate some links not because they create crossing of links, 
but merely for the potential of being involved in crossing. As indicated in the 
example above, if the Gabriel or RNG test is applied to a complex network, the 
spanning planar subgraph that is obtained will contain holes of even larger size. 

The Morelia test attempts to preserve links whenever possible and as a con- 
sequence the resulting planar graph (on which face routing is to be applied) will 
most likely keep the contour very similar to the original contour and the holes in 
the network would not grow much. Thus the resulting planar network will have 
smaller diameter and routes from source to destination will require fewer hops. 
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The Morelia test is similar to the Gabriel test in that given two nodes A and B 
it eliminates links based on the inspection of the circle with diameter AB. Unlike 
the Gabriel test it does not necessarily eliminate the direct link AB when it finds 
another node inside the circle with diameter AB. Instead, it verifies whether the 
nodes inside the circle create any crossing of the line AB. If no crossing is created 
the line AB is kept, otherwise it is removed. The verification of the existence of 
crossing is done in most cases by inspecting only the neighborhood of nodes A 
and B at the transmission distance B. In a few cases, the neighborhood of some 
of the nodes in the circle around AB is inspected. In all cases it is a local test 
that computes the neighborhood of nodes at distance at most two hops of each 
node A and B. 

In the Morelia test of a link AB we subdivide the area of the circle with 
diameter AB into four regions, Xi, X2, hi and I2 as in Figure 3 . The areas 
X2 and Y2 are determined by an arc of a circle of radius R through A and B. 
Furthermore, in the testing we use areas X3 and Y3 as indicated in Figure 3 that 
are outside the transmission radius R of the nodes A and B and within distance 
R from the link AB. For each node A let N{A) be the set of nodes Z such that 
d{A,Z) < R. 

The precise specification of the Morelia test applied to a link AB is given 
below (refer to Figure 3 ). 

Morelia Test Rules 

1 . If there is at least one node in Xi U X2 and at least one node in Yi U I2 then 
the link AB is removed. 

2 . If there is at least one node in Xi and no node in X2 U hi U I2 then the 
node A checks whether any node in N{A) creates a link with nodes in Xi 
that crosses AB. If such a crossing occurs, link AB is removed and A sends 
a message to B to remove the link as well. Similarly node B performs a 
check of nodes in N{B) for a crossing of the link AB and informs node A if 
a crossing is found and AB is to be removed. 




Fig. 3. Morelia Test 
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3. If there is at least one node in Yi and no node in I2 U U X2 (symmetric 
to Rule 2) then the node A checks whether any node in N{A) creates a 
link with nodes in Yi that crosses AB. If such a crossing occurs, link AB is 
removed and A sends a message to B to remove the link as well. Similarly 
node B performs a check of nodes in N{B) for a crossing of the link AB and 
informs node A if a, crossing is found and AB is to be removed. 

4. If there is at least one node in X2 and no node in Yi U Y2 then the node A 
inspects the nodes in N{A) to check whether any node there creates a link 
with nodes in Xi UX2 that crosses AB. If such a crossing occurs, link AB is 
removed and A sends a message to B to remove the link as well. Furthermore 
A sends a message to nodes in X2 with a request to send back information 
whether there is a node in the region Y3. If A receives a message that a node 
exits in I3 then AB is removed and node B is informed to remove the link 
as well. 

5. If there is at least one node in Y2 and no node in X\ U X2 (symmetric to 
Rule 4), the node A inspects the nodes in N{A) for a possible crossing of 
AB. If such a crossing occurs, link AB is removed and A sends a message to 
nodes in Y2 with a request to send back information whether there is a node 
in the region X^. If A receives a message that a node exits in X^ then AB 
is removed and node B is informed to remove the link as well. 




Fig. 4. Examples of the Morelia test situations: a) AB deleted in Rule 1, b) AB deleted 
in Rule 2, c) AB deleted in Rule 4 



Figure 4 illustrates how, unlike the Gabriel test, the Morelia test will check 
for crossings prior to eliminating a link. It will eliminate link AB because it 
detects crossings, but it will not eliminate it when no crossing exists. 

Notice that in the Rule 1 of the Morelia test, nodes A and B look only at 
nodes that are in iV(A) and N{B) respectively, i.e., nodes that are one hop away. 
In Rule 2, 3, and 4 node A checks nodes in N{A), node B checks nodes in N{B) 
and both A and B possibly check N{x) for nodes a; in € X2 U Y2. Thus A ox B 
are checking nodes that are at most two hops away. Therefore, the Morelia test 
is a local test. 

Theorem 1. If network N is connected then the application of the Morelia test 
to all links of N produces network N' which is a connected planar spanner of N. 
Furthermore, N' contains the Gabriel graph of N as its subgraph. 
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Proof. Every edge in N that is kept by the Gabriel test is also kept by the 
Morelia test. Thus, N' contains the Gabriel graph of N as its subgraph. Since 
the Gabriel test produces a connected spanning subgraph of N, the subnetwork 
N' is connected. 

Assume that there is a link e in N' that crosses a link AB of N' . Since the 
length of any link in N is at most R, both ends of e are in N {A)U N {B)U X^UY^. 
If one of the ends of e is in the circle with diameter AB then AB would be deleted 
by the Morelia test. If both ends of e are outside the circle with diameter AB 
then one of the ends of the edge AB must be inside the circle with diameter e, 
since the edges cross each other. However, the Morelia test applied to e would 
eliminate e because of it being crossed by AB. Thus there can be no crossing 
in N'. 

In Figure 5 we show how the Morelia test keeps some of the long links. The 
left-hand side shows a link AB and the nodes inside the circle with the diameter 
AB and the right-hand side shows what edges are kept by the Morelia test. 





Fig. 5. Multiple hop elimination (line segment AB) via the Morelia test 



3.2 The Overhead of the Morelia Test 

In Rule I of the Morelia test node A or B needs to determine what nodes are in 
the circle with AB as its diameter and the complexity of it is the same of that 
of the Gabriel test. 

Rule 2 for A involves nodes that are in N{A) and each node of the network 
needs to know its neighbors anyway. To check for crossings of two line segments 
involves simple geometrical computation of complexity 0(1), and the same ap- 
plies to node B. The exchange of messages between A and B confirming deletion 
or retention of the edge is also simple. 

Rule 3 and 4 of the test involves the existence of nodes in A3 and Y3 that 
are outside of N{A) U N{B). However, all that is needed for A or B is to send 
a message to the nodes in region A2 or I2 asking the question “is there any 
node in Tj or A3?” respectively. The region A3 or Y^ can be specified by the 
three corners of the region. Thus, although these rules involve nodes that are 
two hops away from A and B, it does not create a significant overhead or delay. 
We show now that the size of the region A2 or Y2 is a smaller part of the circle 
with diameter AB. Since Rule 3 and 4 are used only when A2 or I2 is the only 
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Fig. 6. The arc segment Y 2 (and its symmetric one X 2 ) used in Rule 5 



region of the circle containing a node of the network, the probability of using 
these rules is also smaller. 

We can easily calculate an upper bound on the ratio of \X 2 \, the area of 
X 2 to |Xi|, the area of X\ to get an indication on how often Rule 4 is used in 
comparison to the other rules. 

It is easy to see that the ratio |X 2 |/|Xi| is getting smaller when the angle (f) 
is getting smaller. Thus we get an upper bound on |X 2 |/|Xi| by setting (j) = tt/ 3. 
In this case the length of AB equals R and the area of the circle determined 
by the diameter AB is exactly 7ri?^/4, and the area of the circular sector Xi is 

equal to ttR^ - \/(3)i?V4 = Thus ^ = 

^ V V ; / V 12 |JCi| ,rfl:2/8_2^_3^3)/12 



< 0.3. 

StT — 47T — 6-^ (3) 

Thus if a node is located in the circle with diameter AB and its placement 
is random, it will be in the area X 2 U I 2 with probability less than 0.3. 



4 Experimental Results 

In this section we present our simulations designed to test the Morelia Test in 
comparison with the Gabriel Test. The purpose of the simulations was to give an 
indication on how our algorithm actually performs in the real world. We study 
the operation and performance of our algorithm on randomly generated graphs. 



4.1 Goals 

There were two main goals to achieve with our simulations. The first was to 
compare the average link length in the planar spanning subgraph created using 
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the Morelia and Gabriel tests. The second, to compare the number of hops 
required to deliver messages with face routing on the planar spanning subgraph 
created by the Morelia and Gabriel tests. 

4.2 Network Model 

A simulation was designed that allowed the creation of a special type of random 
graph. Since an ad-hoc network does not assume any specific structure, we con- 
sidered network models obtained as follows. A square area (called a grid) could 
be defined, nodes could be created with a specific transmission range and placed 
at random coordinates on the grid. We use the unit disk graph (UDG) model, 
where all nodes have the same transmission range and the nodes were uniformly 
distributed. 

Each network simulation consisted of 50 nodes each with a transmission 
range of 250 units. The test area was a square grid on which the 50 nodes were 
randomly placed. This network setup allowed us to test the performance with 
different link densities by simply varying the grid size (300 units square to 1300 
units square). 

4.3 Analysis of Morelia Graph vs. Gabriel Graph 

The Gabriel Test and the Morelia Test were run on the same series of random 
graphs. The following are the main metrics that were traced in the simulation: 
1) Average length of link on each planar spanning subgraph; and 2) Number of 
links in each planar spanning subgraph. 

Scenario 1 ~ Uniform Random Graph. The first test, was to generate a 
series of 30 random networks. The grid size was the only parameter that was 
altered for a series of trials. This had the effect of increasing or decreasing the 
network density (the smaller the grid size, the closer together the nodes, the 
greater number of links, the higher degree vertices). 



a) Average Length of Links in Planar Spanner (50 Nodes with 250 unit radius) 
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b) Average Number of Links in Planar Spanner (50 Nodes with 250 unit radius) 
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Fig. 7. Average Number and Length of Links in Morelia and Gabriel Graphs 



Figure 7 a), shows the average link length of the Gabriel and Morelia graphs. 
The Morelia graph indeed does keep slightly longer links on average. The average 
increase in length was 4.86 units or 5.76%. 
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Figure 7 b), shows the average number of links kept in the planar spanning 
subgraph for both tests. The Morelia test keeps more links on average in the 
spanner than the Gabriel test. The average increase in links kept in the Morelia 
graph was 6.07 or 7.49% over the Gabriel graph. 



Scenario 2 Random Graph with a Sparse Region. This simulation was 
designed to test the planar spanning subgraph creation when there exists an 
area of the graph which has a less dense topology. A region of 500 units within 
the test grid was defined to have a node density of one-half that of the rest of 
the network. 

Here we report, as in the first test, the resulting average link length and 
number of nodes in the planar spanning subgraph. In Figure 8 a), we see the 
average length of links (edges) kept in both the Gabriel and Morelia graph. The 
average Morelia link length is now 5.57 units or 4.5% longer than the average 
Gabriel link. 



a) Average Length of Links in Planar Spanner (50 Nodes with 250 unit radius) 



b) Average Number of Links in Planar Spanner (50 Nodes with 250 unit radius) 
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Fig. 8. Average Number and Length of Links in Morelia and Gabriel Graphs 



Finally, we show in Figure 8 b), the average number of extra links kept in 
the two graphs. The average number of additional links kept increased by 7.67, 
which i an increase of 10.33% 



4.4 Face Routing Performance Study 

In this section we present the simulation results of the face routing algorithm 
Face-2 [2] on the same series of random graphs on which the Morelia and Gabriel 
tests have produced a planar spanning subgraph. The Face-2 algorithm is a 
modified face routing algorithm that at each iteration of a face traversal makes 
the decision to move to the next face when it determines a link is about to cross 
the line from source to target, instead of traversing the whole face and keeping 
track of all crossings that occur [2] . 

We want to show that, in a network with some sparse areas, the face rout- 
ing algorithm will perform routing with fewer number of hops on average. Two 
scenarios were simulated. The first was on a uniform random graph, the latter 
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a) Average Number Of Hops (50 Nodes with 250 unit radius) 



b) Average Number Of Hops (50 Nodes with 250 unit radius and 500 unit 'hoie') 




Grid Size (units x units) 




Fig. 9. Face routing applied to Morelia and Gabriel Graphs 



scenario was on graphs which had a predetermined sparse region. In these sim- 
ulations, the only metric studied was the average number of hops required for 
the face routing algorithm. 

In the first scenario, a uniform random network of 50 nodes was built. Each 
node had a transmission range of 250 units. As before, the grid size was the only 
parameter changed to vary the network density. 

We see in Figure 9 a), that in a uniformly distributed network, only minor 
gains are made in terms of decreasing the number of hops. The average number 
of hops saved was 0.28 per route. 

In the last scenario, we define a region 500 units square and set the number 
of nodes with the region to one-half that of the first test. Routes were chosen 
that would traverse the region. All other parameters remained as in the previous 
scenario. 

In Figure 9 b) we can see that there is further improvement of the face routing 
algorithm. The average number of hops saved with face routing on the Morelia 
graph has increased to 2.03 hops per route or 10.6%. 

5 Conclusion 

We have shown that with the application of the Morelia test to the underlying 
structure of a wireless ad-hoc network we can achieve face routing with a fewer 
number of hops on average over the Gabriel test when the network has a sparse 
region Thus the use of the Morelia test would be beneficial in any wireles ad-hoc 
network having a complex topology. 

There are several open problems for future work in this area. One such prob- 
lem is the extension of the algorithm to a network which is changing dynamically. 
Another area to investigate would be the improvement of the application of the 
Morelia test in networks which are very dense. If a network is very dense, i.e. the 
number of nodes inside a circle of transmission radius R is very high, the Morelia 
test introduces more overhead than the Gabriel test, the increase in th number 
of long links kept is not significant, an and thus the number of hops from source 
to destination remains high. Finally, a study of the Morelia test in combination 
with other tests that reduce the initial density of links in a network. 
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Abstract. Path Layout is a fundamental graph problem in label switching pro- 
tocols. This problem is raised in various protocols such as the traditional ATM 
protocol and MPLS which is a new label switching protocol standardized re- 
cently by the IETF. Path layout is essentially the problem of reducing the size 
of the label-table in a router. The size is equivalent to the number of different 
paths that pass through the router, or start from it. A reduction in the size can be 
achieved by choosing a relatively small number of paths, from which a larger set 
is composed using concatenation. 

In this paper we deal with three variations of the Path Layout Problem according 
to the special characteristics of paths in three label switching protocols, MPLS, 
ATM and TRAINEE We focus on tree networks and show an algorithm which 
finds label-tables of small size while permitting concatenation of at most k paths. 
We prove that this algorithm gives worst case tight bounds (up to constant factor) 
for all three models. The bounds are given as a function of the size of the tree, 
and the maximum degree. 



1 Introduction 

Improving label switching technologies is a step towards better QoS (Quality of Ser- 
vice) in high speed and IP networks. Label switching is done by using a quite simple 
idea of label based switching decisions, a packet is switched in the network according 
to a label it carries. The concept of Label Switching is not new. Its origins are in virtual 
circuit networks such as ATM networks. However, only recently the combination of 
switching technology and IP routing became a popular avenue of research and develop- 
ment. The large number of different label switching protocols [7] (tag-switching [18], 
CSR [15], IP-Switching [17], and threaded indices [6]) motivated the IETF to develop 
MPLS (Multi Protocol Label Switching), a standard approach to label switching [5]. 
MPLS was designed to combine the best of several approaches. 

MPLS was developed to overcome some drawbacks of the standard IP routing. IP 
routing is basically a simple method of hop by hop routing. When a packet reaches a 
router, the routing decision (on the next router in the path) is based only on the des- 
tination address of the packet. At any given point in time all packets with the same 
destination address would be routed in exactly the same way, regardless of any other 
parameter of the packet. At each router along the packet path, the destination address 
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of the packet is examined, and the longest prefix that matches the address, out of all the 
prefixes (usually several tens of thousands) in a forwarding table is found. The packet 
is then routed according to the information associated with the Best Matching Prehx 
(BMP) in the forwarding table. 

The high cost of computing the BMP at each router combined with the inability to 
distinguish between different flows with the same BMP motivated the development of 
MPLS. All the packets of the same flow are tagged with a unique label upon entering 
the network layer. At each switch (router) on the path the label is used to switch (rather 
than route) the packet to its next hop. The label is used as an index in the switching 
table. That is, label switching supports the concept of “route once and switch many”. In 
other words, for all the packets in a flow, routing is performed once when setting up the 
label path, and the label is then used for all the packets at all the intermediate switches. 
Essentially MPLS establishes a circuit between the source and destination which in the 
terminology of MPLS is called Label Switch Path (LSP). 

The basic advantages of MPLS are as follows. Elimination of the time consuming 
BMP enables higher rates of packet processing. The second advantage is support of QoS 
forwarding by assigning different labels to flows that require different services even if 
destined to the same destination (we do not use such options in this paper). Finally, 
MPLS enables explicit routing. The source router in the path establishes the path by 
some setup mechanism (such as RSVP). Henceforth the forwarding is performed by 
switching on the label which was associated with the explicit route. The explicit routing 
feature of MPLS enables the support of good traffic engineering (design of paths in the 
network), load-balancing, and support for resource reservations. 

In order to support hierarchical routing (between networks and inside them), MPLS 
supports multi label paths by placing a stack of labels on each packet of a correspond- 
ing flow. In addition, in MPLS each router may use operations on the stack of labels, it 
can be instructed to push or pop one or more labels. This mechanism is also useful for 
path concatenation [1, 3]. A path is described by concatenation of LSPs and each LSP 
described by a label in the stack label. One of the applications of path concatenation is 
using a small set of LSPs to compose a larger set. The main advantages of this applica- 
tion is reducing the number of labels necessary in the network thus enabling more label 
switched routes with the same number of labels. Labels in MPLS are a scarce resource. 
They are the key elements in the scalability of MPLS, and the more labels we need the 
larger the switching tables are. Since the switching has to operate at high speeds, it is 
built out of expensive memories and its size is critical. 

This paper deals with the following fundamental graph problem. How to choose a 
small set of routes in order to compose a larger set of routes by using concatenation. We 
mainly focus on tree networks, which are relevant to many server client applications, 
such as multicast, and can also be used as a building block for solutions in general 
graphs [9]. 

This question is not new. This is the basically Virtual Path Layout in ATM, that 
was considered in the literature [21, 19, 13, 2, 20, 8, 4, 12, 9, 10] in the blossoming year’s 
of ATM protocol. ATM [14] is also protocol based on circuits, where a circuit is a path 
defined between a source and a destination. In ATM, a VC or a Virtual Channel connects 
between users and may concatenate many VPs (virtual paths). The main rule of VP is 
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an internal network usage, of reducing the cost of management the establishment of 
VCs. The concatenation of paths in ATM is done by using the VCI/VPI labels, where 
VCI is the index of a VC and VPl is the index of a VP. Basically the VCI is the glue 
that concatenates several VPIs. 

Our first contribution in this paper, is giving tight (up to a constant multiplicative 
factor) upper and lower bounds for the worst case maximum load on any router in a tree 
network. This result also shades some light on the corresponding open issue in ATM 
networks. 

The second contribution in this paper, is to model the problem of path layout in three 
different label switching models, ATM, MPLS and a new suggested extended version of 
MPLS, Trainet. We examine the different power of the models by demonstrating them 
on the problem of path layout on tree networks. 

One of the key differences between MPLS and ATM is caused by the path merging 
capability in MPLS. MPLS permits tagging all the packets with the same remaining 
path with the same label, even if there initial path was different [5]. Hence when a label 
switch path is built in the network, one may use all its, with no extra charge of building 
and tagging these paths with different labels. 

The third model, the Trainet Model, is a model suggested by [1]. The main idea, 
of this extended MPLS model, is to increase the utilization of each label by using it to 
define each sub-path of a labeled switched path (LSP) as a legal route. It is achieved by 
replacing the label with a <label,counter> pair. At each switch the counter shows how 
many more hops the corresponding packet should take on this LSP. This is much like 
taking a subway, where the packet has to go off the train after a specified number of 
stations. This model resembles the idea of Manhattan networks [16]. 

The outline of the rest of the paper is as follows. The different models are formalized 
in Section 2. We discuss related work in Section 3. The main result of the paper is 
presented in Section 4. The algorithms for different cases are given in Section 5. The 
lower bounds for different cases are given in Section 6. Section 7 explores some results 
for the different models, which are derived from the main result. We finish the paper 
with some conclusions. 

2 Model 

All label switching models establish switch-paths. A packet moving in a network, may 
move only along a switch-path. Each packet is assigned a label, corresponding to the 
switch path it is supposed to use . A packet traverses its path using a switching tech- 
nique. We begin with defining switch-paths. Virtual Paths (in ATM), Label Switch Paths 
(in MPLS) and Trainet Switch Paths (in Trainet) are all instances of a switch-paths. 

We model the network, as a bidirectional graph G = (V,E), where the nodes 
are routers and the edges are physical connections between the routers. In some cases 
(where we are interested in paths from a single source) we consider directed graphs. 

Definition 2.1. A switch-path SP in graph G is a simple directed path. 

The label switching models differ in the number and the type of different routes 
(sub-paths of switch-paths) that a packet may traverse using a specific switch-path of 
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the specific model. Metaphorically, the switch-path resembles a directed express train- 
line, which the packets can use for fast switching. The routes that packets may use 
are all sub-paths of the train line. The ATM switch-paths, called also Virtual Paths, 
define only one route each. In order to use the path, a packet must begin its trip in the 
first station, the source of the switch-path, and may leave only in the last station, the 
destination. In MPLS, the packet may enter the train-line at any station, but must stay 
till the end of the switch-path. A label switch path in MPLS defines a subset of routes, 
each of which corresponds to a suffix of the label switch-path. In the Trainet model, 
a packet may enter the train-line at every station, and get off at any later station. A 
Trainet switch-path defines a set of routes, each of them is some sub-path of the switch- 
path i.e., each path in the Trainet model introduces all its sub-paths as routes. Next, 
we give definitions, which formally define the routes that are induced by each type of 
switch-path. 

Definition 2.2. Let V be a Virtual Switch Path. We define the set of routes that are 
induced by V SP in the ATM model, simply as {V}. 

Definition 2.3. Let L be a Label Switch Path. We define the set of routes that are in- 
duced by L in the MPLS model, as the set of routes {r\r is suffix of L}. 

Definition 2.4. Let T be a Trainet Switch Path. We define the set of routes that are 
induced by T in the Trainet model, as the set of routes {r\r is (a non-empty) sub-path 
ofT}. 

In the next definition, we define the path layout in a Model M (where M can be 
ATM, MPLS or Trainet). 

Definition 2.5. Let G = {V, E) be a directed graph. Let P be a collection of routes in 
G. A collection of switch-paths P' in G is a fc-hop path layout of P in the model M, 
iff for every r G P, it can be created by concatenating of at most k valid routes induced 
by P' in the model M. 

Our paper focuses on finding a path layout that minimizes the size of the largest 
label-table (or the switching table) of any router. A label-table of a router is the table 
that contains the database of the labels. The size of this table for a given router vertex 
is also called the vertex load. Next we give a formal definition of vertex load in the 
different Models. Similarly to the load of a vertex, one can define the load on an edge. 
We prefer to discuss the vertex loads due to their direct connection to the sizes of the 
label-tables. The vertex load of node v is denoted L{v). 

Definition 2.6. Let V SP be the set of all the virtual paths in an ATM networks. Let 
VSP{v) be the subset of paths p £ VP such that v £ p. L{v) = \VSP{v)\. 

Definition 2.7. Let LSP be the set of all label switch paths p in a MPLS network. Let 
LSP' be the set of all routes induced by LSP. Let LSP'{v) be the set of all routes p 
in LSP', such that v is the first vertex in p. L(v) = \LSP'(v)\. Note that this definition 
captures the merging of labels capabilities of the MPLS model. 
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Definition 2.8. Let TSP be the set ofTrainet switch paths in a Trainet Network. Let 
TSP' be the set of all routes induced by TSP. Let TSP'{v) be the subset of all 
routes in TSP' such that v is the first vertex in p. Let TSP"(v) be a minimal sub- 
set of TSP' {v) such that for each p' £ TSP'{v) there exists a route p" £ TSP"{v) 
such that p' is a prefix of p" . L{v) = \T S P" (v)\. Note again, that this captures the 
merging of labels capabilities of the Trainet model. 

We say that a path layout has vertex load m if the maximum load on any vertex in 
the graph is m. Our paper deals with finding a fc-hop path layout, that requires minimum 
vertex load, and hence minimizes the memory used for the incoming lahel-tahle at the 
router. The paper focuses on the case of a tree network where the basic set of paths P is 
naturally a set of shortest paths. In the following dehnitions, we define three problems 
of path layout in a tree, one-to-all path layout, all-to-one path layout, and all-to-all path 
layout. 

Definition 2.9. Let r be a designated vertex in graph G. A one-to-all path layout is a 
path layout where r has shortest path routes to all other nodes in G. 

Definition 2.10. Let r be a designated vertex in graph G. A all-to-one path layout is 
a path layout where all the vertices in G have shortest path routes to r. 

Definition 2.11. A all-to-all path layout is a path layout in G, where every node has a 
shortest path route to every other node in the graph. 

Throughout the paper, the symbol n denotes the number of nodes in the network. 
The size of a tree is defined to be the number of nodes in it. 

3 Related Work 

While there is extended literature about the virtual path layout in general ATM networks 
[2, 20, 8, 4, 12, 9-11], there is no work, to the best of our knowledge, that deals with the 
new model of label switch path layout. 

The paper of Gerstel, Cidon and Zaks deals with the problem of virtual path layout 
in a tree network [9]. In that paper, an optimal polynomial algorithm for finding a virtual 
path layout (in ATM model) with minimum edge load was introduced. Gerstel and Zaks 
[13] provided a greedy algorithm that constructs a layout with smallest possible vertex 
load. However, to the best of our knowledge, there is no tight bound known for vertex 
loads in tree networks. The paper [13] does not analyze the resulting optimal vertex 
load found by the algorithm. The results of [9, 13] are important and raise many further 
questions. There is practical need in knowing the maximum resulting load, the reason 
to study the vertex load is that it is equivalent to the maximum size of the label-table of 
any vertex. Our paper, fills this gap, and gives a lower bound for the minimum vertex 
load that any algorithm can hnd. We feel that our lower bound, is an important step in 
reaching a real understanding and estimating the gain by using path concatenation. 

Our paper also introduces polynomial algorithms, which give a tight upper bound. 
The paper [10] gives an upper bound of 0{y/n) of edge load for the 2-hop path layout, 
which leads to vertex load of 0{dy/n). In this paper we give algorithms for the general 
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/c-hop path layout problem. Specifically, for the special case of the 2-hop path layout, 
our algorithm gives a vertex load of only 0{\/dn). 

To the best of our knowledge, our work is also the hrst paper to model the virtual 
path problem in the MPLS and Trainet models. 

4 Main Results 

In the rest of the paper, we prove the following theorem, and derive some results on the 
different models from it. We consider a network which is a rooted directed tree. We de- 
hne d to be the maximum out-degree of any node in the directed tree. It is also possible 
to see the tree as undirected. This does not change the result since the maximum degree 
in that case is at least d and at most d + 1 and a change in d by a multiplicative factor 
of at most two does not affect the results. I.e., the following theorem gives tight bounds 
on the worst case vertex load for the one-to-all fc-hop path layout in tree networks. The 
theorem holds in all three models. 

Theorem 4.1. Given a rooted directed tree with n nodes and maximum degree d. For an 
integer fc > 1, the following function fk describes the maximum size of the switching 
table of any node (i.e, the vertex load), for a fixed value of k (maximum number of 
concatenated paths). 

Ifk is even then 

{ &(n'^d~^), ford £ [1, n^+i ]; 

0((nd) '=+2 ), for d £ [n '=+i , n'^]; 

0(d), for d £ [n^,n]. 

If k is odd then 

{ e(n^d^), ford£[l,n^]; 

0(n '=+1 ), for d £ [n'^+T , n'^+T]; 

0(d), for d £ [n'“+i ,n]. 

There is one exception which is the case d = 1 in the Trainet model, for which 
fk(n,d) = 1. 

To prove the theorem, we need to show algorithms for all cases, and to give negative 
examples to all cases as well. We do that in the next sections. 

5 Algorithms 

We start with designing the algorithms, and construct the lower bounds in the next 
section. We dehne a procedure DECOMPOSE. This procedure is used by the algorithms 
for all the different cases. The procedure receives a rooted directed tree T' of size m, 
and an integer £ < m and returns a node v. v is the root of a minimal sub-tree of size 
at least £ (we require the sub-tree to contain all the descendants of v in T'). Note that 
since the sub-tree is minimal, the size of the sub-trees of the children of v are smaller 
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than £. The operation of DECOMPOSE is iterative and simple; It moves a pointer along 
a descending path in the tree. The pointer starts at the root of T, and goes along a 
directed edge to a child whose sub-tree is of size at least £, as long as such a child 
(whose sub-tree is large enough) still exists. We later show how to combine several 
iterations of decompose into a single run of DES. 

All algorithms have a similar general structure. The original tree T is decomposed 
recursively into smaller and smaller trees, until the trees are small enough. In each 
decomposition step, some switch-paths are defined. 

Case 1: An algorithm for even or odd k and d G [1, n^+r] 

Let £q, , £k-i be a monotonically decreasing sequence such that £q = n. The values 

£i serve as thresholds for sizes of trees (numbers of nodes). We define sets of roots 
i? 0 j • ■ • Rk -1 in the following way. i?o = {f}, where r is the root of T. Ri contains 
nodes, which are defined to be roots of sub-trees (those are sub-trees which contain a 
subset of their descendants), such sub-trees have sizes of at most £t. Note that the sets 
are not necessarily disjoint. Moreover, each vertex rj G Ri, has an associated vertex 
in Ri-i, which is expressed by a function fi : Ri ^ Ri-i- We explain below how to 
create the sets and the functions. 

The algorithm uses the term “trees of type i”. Those are trees whose root is in Ri. 
Clearly the size of a tree of type f (0 < t < fc — 1) is of size at most £i. The original tree 
T is a type 0 tree. A root of a tree of type i, i.e. a node of Ri, is called a root of type i. 
The root of T is also called the main root. 

Decomposition into smaller trees: The algorithm uses the set of thresholds to recur- 
sively define sefs of trees of type i for each i. Starting with T (which is the only tree of 
type 0), trees of type i-\- \ are simply defined by decomposing trees of type i. 

The structure of switch-paths: Switch-paths are defined in a way that for each root r 
of type either there exists a root of type i (/i+i (r)) with a switch-path going to r, 
or r itself is a type i root. The exact way to do this is defined together with the defining 
the decomposition algorithm. 

Einally each root of type A: — 1 is defined to have switch paths to all the nodes in 
its sub-tree (of type k — 1). The value of £k-i has to be relatively small so that roots of 
type A: — 1 do not have a large vertex load. 

Using the above set of paths we get that the set of all concatenations of at most A: — 1 
switch-paths from the main root, allow us to get to every root of type A; — 1. Using the 
switch paths from roots of type A; — 1 we get that it is possible to use a concatenation of 
at most k switch-paths to get from the main root to any node. 

An algorithm for decomposing a tree of type i into trees of type i 1: Given a tree 
Ti of type i, fix = n and do fhe following until the tree Ti is left with 

at most £i+i nodes: (If already \Ti\ < £i+i, then the tree is not decomposed further). 
Run DECOMPOSE with the parameter on the tree, and get a sub-tree rooted at x. 
Add all paths from the root of Ti to x and its direct children to the set of switch-paths, 
and remove this sub-tree from Ti. Let x be the root of the removed sub-tree. Each direct 
child y of a; is defined to be a root of type z -f 1 (the tree of type z -|- 1 contains all the 
descendants of y that were not removed previously from the tree). Since the sub-tree 
of X is minimal, the size of the sub-tree of each child of x is of size at most £i+\. The 
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remainder of Ti (after all removals) is also defined to be a tree of type i+ 1. Recall that 
by definition, the remainder is of size at most £i+i. 

Analysis of the vertex load: The number of times that DECOMPOSE was run (for a 
given i) is at most Consider the labels given only while decomposing trees of type 
i into trees of type i + 1. Eor each sub-tree of type i + 1, at most <i+ 1 new switch-paths 
are defined, hence the root of Ti is assigned at most {d -L 1 ) 7 ^ new labels. Using 

£i = c?'^, we get = n^d~^ and the number of new labels assigned to each 

root of type i < fc — 1 is at most (c? -I- l)n^d~^ = 0{n^d^~ ) labels. For i = k — 1, 
the size of each tree of type A: — 1 is at most Ik-i = n^d ^ , hence each root of type 

1 fc — 1 

k — I also has 0{nkd ~^ ) new labels defined for switch-paths to its descendants. To 
calculate the total number of labels that any node has received, note that each node in 
the tree T belongs to at most one tree of each type. While considering trees of type i, 
all switch-paths were defined inside the trees. In the worst case, a node in a tree of type 
i belongs to all switch-paths starting at its root. Since there are k types of trees, and k 
is constant, each node has at most 0{nid ~<^ ) labels of different paths. 

1 

Case 2: Algorithm for odd k and d > 

The algorithm is similar to the previous algorithm. We specify the differences between 
this case and case 1. Instead of k types of trees, there are only types. The number 
of thresholds is naturally also and again £q = n. Clearly, the number of recursive 
decomposition phases is also smaller. 

The structure of switch-paths: For each root of type i -\- 1 {i > 0) there exists a 
concatenation of at most two switch-paths to it from some root of type i (the root of 
type i-\-\ may be already a root of type z, this can be seen as a special case). In the trees 
of type there is a switch-path from each root to all the nodes in the sub-tree. 

The maximum number of switch-paths that need to be concatenated to be able to get 
from the root to some node is two times the number of phases of decomposition, plus 
1 (for the switch-paths from every root of type to its descendants). This number is 
exactly k. 

An algorithm for decomposing a tree of type i into trees of type z -|- 1.' We set 

k-2i-l 

= zz z+i . To decompose a tree of type z, Ti, into trees of type z -|- 1, we again 
run DECOMPOSE with the parameter £i+i, removing each tree until \Ti\ < £i+i. Let 
a; be a root of a removed sub-tree. A switch-path is fixed from the root of Ti to x, and 
switch-paths are also fixed from x to all its direct children. The sub-tree of each direct 
child is defined to be a tree of type z -|- 1 . The remainder of Ti after all removals is also 
defined to be a tree of type T^+i. 

Analysis of the vertex load: To calculate the number of labels given to a node while 
considering trees of type z, note that there are two cases. Nodes got labels either on a 
switch-path from a root of type z, or on a switch-path of length 1 (to roots of type z -|- 1 
from their direct parents). In the first case, a root of type z is assigned at most labels 

for z < and at most £i for z = This equals to n'^ . In the second case, a node 
is defined to have switch-paths going to its children, which gives at most d labels for 
a node. Hence the maximum number of labels given to one node is 0(max(ci, n^+r)). 
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Using similar reasoning as in the first case, the total number of labels given to one node 
in total is also 0(max(ti, )). 

1 2 

Case 3: Algorithm for even k, d G , n^] 

We use a combination of the first and the second algorithms. We omit the details 
Case 4: Algorithm for even k, d> ni 

Let a = \ — 1- We run the second algorithm with a + 1 types of trees (-% = %, for 
0 < z < a. We omit the details 

Running time: We assume that the algorithm needs to gather information, but does not 
need to construct the paths. The decomposition of a tree of type z into trees of type z + 1 
can be done e.g. while running a Depth First Search on the tree. Each node may have a 
counter of the size of its sub-tree. As soon as the search returns to a node with enough 
descendants, switch-paths from the root are defined, and the counter is set to zero. We 
run this once for each tree (may be done recursively, running a new DFS when a node 
is declared to be a root of some type). The running time is 0{n) for each type of trees 
(each node or edge belongs to at most one tree of each type). The total running time is 
0{kn). We can conclude that the running time is linear in n. 

The upper bounds above assume that fc is a constant, which is the case in reality. 
However, even if k is non-constant, it is possible to get the same upper bound in the 
Trainet model. The details are omitted. 

6 Lower Bounds 

The lower bounds presented in this section are valid for all three models. Whenever, we 
use routes, in the proof, the routes are induced by the switching path according to any 
of the models. To prove lower bounds we introduce several trees in which at least one 
node has a large switching table. 

If d = 1, then the tree is a chain. We need to show that there exists a node with 
I7(n'^) labels in the ATM and in the MPLS models. Assume to the contrary that all 
nodes have a table of size at most |n ^ . Starting at the root of T, using a concatenation 
of exactly i routes, the root has access to at most different nodes. Using a 

concatenation of at most k routes we get access to at most 1 -|- (|zz't) -|- -I- 

. . . < 1 -I- |n -I- (|)^n -I- . . . < 1 -I- ^ < n (for n > 3). Contradiction. 

For other cases, we build trees of sizes at most n that give negative examples. 

Case a: Lower bound for d G [2, n'^ ] 

We define the following tree types. Let n' = zz/2. Let Ti be a maximum complete 
binary tree with at most leaves and T 2 a maximum complete binary tree with at 

most leaves (if < 1, or < 1 use a tree which consists of a single 

node instead of a binary tree. Note that < n). 

Let T[ (resp. Tly be a T\ (resp. T 2 ) tree such that each leaf has d children. We 
define a concatenation of two trees S\ and S 2 as follows : S 1 S 2 is rooted at the root of 
Si and each leaf of S\ has a 52 as a sub-tree (the leaf of the S\ is a root of an 82 )- Let 
T = (See Figure 1). Note that T has at most n' leaves, and at most 2rz' = n 
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Fig. 1. Illustration of the lower bound tree of Case a 



1 fc — 1 

nodes. We need to show that some node has n{n'^d~^ ) labels. For a set of vertices Q 
in a tree T, let D{Q) denote the set of all descendants of nodes in Q (including Q i.e. 
Q Q D{Q)). For a tree T with a given switching table, where the root can reach any 
node concatenating k routes or less, let T{i) {Q < i < k) denote the set of nodes that 
are reachable from the root by at concatenating at most i routes, or some descendant of 
the node is reachable. It is easy to see that T(0) is the root and T{k) is the set of all 
nodes. 

Assume that at most ) labels for the root and at most ) labels 

for any other node are enough. We will reach a contradiction which would mean that 
some node needs I7(ni ) labels. Let /q be the set of leaves of the tree T[, and let 
li be the set of leaves of T{(T 2 )® for i = 1 , .., fc — 1 . 

We can prove the following claim using induction if we assume that the root has at 
most |(n' '= d~’^ ) labels, and each other node has at most d ~>^ ) labels. 

Claim. For every 0 < i < fc — 1, the set contains a subset /' which satisfies the 
following properties: 1 . |/'| > f- 2 . All vertices in /' have the same parent node. 3. 
T(i + 1) n D{I[) = 0 (i.e. it is impossible to get from the root of T to any of the nodes 
in the subset or any nodes in their sub-trees by concatenating at most i + 1 routes). 

The claim shows that there exists a leaf node of T which does not have a concatena- 
tion of fc routes from the root which leads to this leaf. This is a contradiction. (Actually 
we showed that there are at least | such leaves). 

1 2 

Case b: Lower bound for even fc and d G [n fc+r , n^] 

We build the following tree: Let n' = n/2. Let T 3 be a maximum complete binary tree 
with at most leaves (this is well defined since d < ni). Let T 3 be a defined 

d2 

fc 

analogously to T/ in previous proof. Let T = (T 3 ) 2 T 3 . Note that T has at most n' 
leaves, and at most 2n' = n nodes. Define J 2 i-i to be the leaves of {T^Y (1 < i < |), 
Jfe be the leaves of T and let J 2 i be the parents of J 2 i+i (0 < i < | — 1). 

Assume to the contrary that at most i(n'd )^+2 labels for the root and at most 
^{n'd) ^+2 labels for any other node are enough. 

If the root has at most | (n'd) labels and each other node has at most | (n'd) 
labels, we can prove the following claim, which is similar to the claim for case a. The 
proof is omitted. 
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Claim. For all odd i, there exists a set J' C Ji which satisfies the following properties: 
1-141 > ^.2. All vertices in J' have the same parent node. 3. T(z) n D{J[) = 0 (i.e. it 
is impossible to get from the root of T to any of the nodes in the subset or any nodes 
in their sub-trees by concatenating at most i routes). 

For even i > 0, there exists a set J' C Ji which satisfies the following properties 
1. All vertices in J' have the same ancestor node z in Ji_i, i.e. they belong to a single 
T 3 tree. 2. \ J[\ > ^\D{{z}) n Jt\ (i.e. contains at least half of the leaves of some T 3 
tree). 3. T{i) n D{J'^ = 0 (i.e. it is impossible to get from the root of T to any of the 
nodes in the subset J' or any nodes in their sub-trees by concatenating at most i routes). 

Using the claim for i = k we get that there exists a leaf of T which is not reachable 
from the root by concatenating k routes. Contradiction. 

Case c: Lower bound for odd k, d £ , n fc+r] 

2 

Let T 4 be a maximum complete binary tree with at most leaves (note that this 

value is larger or equal 1). Let be defined analogously to T{. Let T = The 

proof for this tree is very similar to the previous proof. 

We omit the lower bound proof of the other cases. 

7 Results in the Different Models 

In this section we show two corollaries which adapt the basic algorithm to give an all- 
to-all path layout in general tree networks for the Trainet Model, and for directed tree 
networks in the MPLS model. The proofs of the claims in this section are omitted. 

The special properties of the Trainet model allow us construct an all-to-all path 
layout with no extra cost. 

Claim. Given a bidirectional tree. In the Trainet model, it is possible to build a /c-hop 
all-to-all path layout with the same vertex load as a fc-hop one-to-all path layout in the 
same model. 

In the MPLS model, it is also possible to extend the reachability with no extra cost. 
This can be done in the directed rooted tree. 

Claim. Given a directed tree, in the MPLS model it is possible to build a fc-hop all-to- 
all path layout using label-tables of the same size as are used for a fc-hop one-to-all path 
layout in the same model. 

8 Conclusions 

We have given tight bounds on the one-to-all path layout in trees. This leaves several 
open questions. Is it possible to give tight bounds for general graphs? Is it possible to 
give tight bounds for the all-to-all path layout? Note that for the Trainet model this 
paper gives a tight bound for the all-to-all path layout, but the same question for the 
MPLS and ATM models is still open. 

Acknowledgments: We gratefully acknowledge the fruitful discussions with Haim Ka- 
plan and Yehuda Afek from Tel- Aviv University. We also thank Rob van Stee from CWI 
for reading the paper and commenting on it. 
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Abstract. We obtain slightly improved upper bounds on efficient ap- 
proximability of the Maximum Independent Set problem in graphs of 
maximum degree at most B (shortly, B-MaxIS), for small B > 3. The 
degree-three case plays a role of the central problem, as many of the 
results for the other problems use reductions to it. Our careful analysis 
of approximation algorithms of Berman and Fujito for 3-MaxIS shows 
that one can achieve approximation ratio arbitrarily close to 3 — 
Improvements of an approximation ratio below | for this case trans- 
late to improvements below of approximation factors for B-MaxIS 
for all odd B. Consequently, for any odd B > 3, polynomial time al- 
gorithms for B-MaxIS exist with approximation ratios arbitrarily close 
to — 4(5^/^-l8) , This is currently the best upper bound for 

B-MaxIS for any odd B, 3 < B < 613. 

Keywords: maximum independent set, approximation algorithm, 
bounded degree graphs 



1 Introduction 

The problem of finding an independent set of maximum cardinality in a graph, 
the Maximum Independent Set problem (MaxIS), is one of fundamental NP- 
hard combinatorial optimization problems. Recall, that an independent set in a 
graph is a set of vertices that are mutually non-adjacent. The MaxIS problem 
is notoriously known for its approximation hardness. In general graphs the best 
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known polynomial time approximation algorithm achieves an approximation ra- 
tio [3]. The result of Hastad [7] explains this phenomenon by providing 

a strong lower bound of v}~^ for any e > 0, under a reasonable complexity 
theoretic assumption that NP-complete problems do not have randomized poly- 
nomial time algorithms. 

The restricted versions of i?-MAxIS with very small degree bound B (like 
S = 3, 4, 5) have turned up to be important as intermediate steps to many 
other problems, e.g., the Metric Travelling Salesman problem. Currently 
the best lower bounds are || for 3-MaxIS, || for 4-MaxIS, and || for 5- 
MaxIS, obtained in [4]. Namely, it is NP-hard to achieve an approximation 
within the corresponding multiplicative factor for each of problems. On the other 
hand, approximation ratios of algorithms have been continuously improved by 
a number of new techniques and better analysis. The best upper bounds on 
approximability for i?-MAxIS with small degree bound B > 3 are provided 
by algorithms of Berman, Fujito, and Fiirer ([1], [2]) with approximation ratios 
arbitrarily close to The basic ingredient of their heuristics is a type of a 
local search algorithm that additionally searches in the complement of a current 
solution. Such local search algorithm, taking advantage of degree boundedness, 
searches for an improvement of a current feasible solution in far distance. To 
accelerate its performance even further, some general reduction methods and 
tricks are incorporated in the most difficult case, when B = 3. 

Asymptotically, better lower and upper bounds have been obtained. Trevisan 
[8] has proved NP-hardness of approximation for B-MaxIS within 
Halldorsson and Randhakrishnan [6] presented an 0( ipg i^g b )~approximation via 
subgraph removal technique, and then Vishwanathan (firstly documented in [5]) 
has observed that an improved approximation factor of can be 

obtained using semi-definite programming relaxation combining 6;nown combi- 
natorial and algorithmic results on Lovasz theta function and /c-colorable graphs. 

Despite this fact, for small values of B (namely, for 3 < B < 613), the upper 
bound -|- £ hasn’t been improved over the years. 



1.1 Our Contribution 



In this contribution we improve slightly the upper bounds on efficient approx- 
imability for R-MaxIS in case of small degree bound B. We essentially follow the 
approach of Berman and Fujito and make the careful analysis of their algorithms 
for 3 -MaxIS presented in [1] . This results in approximation ratio arbitrarily close 
to 3 — for the problem. 

As a consequence, the approximation threshold for R-MaxIS is also strictly 
less than for any odd B > 3. This follows from the recursive method of 
Berman and Fiirer [2] that an efficient rB_ 2 -approximation algorithm for (B—2)- 
MaxIS can be used to design the one for R-MaxIS with an approximation ratio 



tb arbitrarily close to 1 -I- Consequently, a polynomial time algorithm 

for R-MaxIS (with odd B > 3) exists with an approximation ratio arbitrarily 

close to S+3 - 4(5vT3-18) (B-2)!! 

Close lo 5 5 (B+ 1 )!! ’ 
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This is now the best upper bound for B-MaxIS for any odd B, 3 < B < 613. 
It is worth pointing out that using Nemhauser- Trotter Theorem this also leads to 
better than (2— -g^)-approximation for the Minimum Vertex Cover problem 
in graphs with maximum degree B. 

In [I] Berman and Fujito declare that their algorithms for 3 -MaxIS in 3- 
regular graphs have approximation ratios arbitrarily close to However, we 
show that the arguments from [1] are flawed and this result can’t be taken as 
guaranteed by their proof. 

1.2 Preliminaries and Notation 

For a simple graph G = (V, E), let a{G) denote the size of a maximum indepen- 
dent set in G. For a vertex subset U C V, let G{U) denote the subgraph of G 
induced by U and N{U) the set of neighbors of U, i.e., N{U) = {u G V : G 

U such that (u,v) G E}. 

The local search method is based on augmentation of an independent set U by 
a vertex set called an improvement. A subset / C V is an s-improvement for U, if 
G{I) is a connected graph, the symmetric difference C/ 0 / is an independent set 
larger than U, and |/\C/| < s. A solution is s-optimal if it has no s-improvement. 

The algorithm of Berman and Fiirer ([2]) for building a large independent 
set in low degree graphs G = (V, E) is a local search in two directions. For 
some constant parameter k, a solution A is repeatedly augmented by applying 
improvements of size 0{k log |V |). When such augmentation is no longer possible 
and G was of maximum degree 3, one can And efficiently a maximum independent 
set A! in its complement G(V \ A), because it is a graph of maximum degree at 
most 2 (otherwise A was not 1-optimal). If |A'| > |A|, the local search is started 
again from A! . The resulting solution A is 0(fclog |V|)-optimal and a(G(V \ 
A)) < |A|, which together assure (| + \)\A\ > a{G). Moreover, this inequality 
is tight ([2]). Berman and Fujito in [1] suggested some additional reductions to 
improve the performance ratio for 3 -MaxIS to | 0 e. 

2 Approximation Algorithm for .B-MaxIS 
in Low Degree Graphs 

The algorithm Ak presented in Fig. 1 is Berman and Fujito’s algorithm for 
3 -MaxIS ([!]) with some minor changes. 

The algorithm Ak consists of four phases: (1) preprocessing, (2) a construc- 
tion of initial solution, (3) a local search, and (4) postprocessing. A set S and 
a stack are used in preprocessing and postprocessing phase for the construction 
of a solution for an input graph G. 

(1) In the preprocessing phase Branchy, Nemhauser-Trotter, and Small Com- 
mitment reductions are applied to a graph as long as it is possible to reduce the 
graph. 

Branchy reduction: (i) While there is a vertex v of degree 0 or 1 in G, add v 
to the set S, and remove it (along with its neighbor, if any) from G, (ii) while 
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Input: A graph G = (V,E) of degree 3 

{preprocessing phase} 

repeat 

oldsize := \V\; 
do Branchy reduction; 
do Nemhauser- Trotter reduction; 
do Small Commitment reduction; 
until \V\ — oldsize; 

{construction of initial solution (for reduced graph)} 

Ai-.= {v£V ■. d{v) = 2}; 

if Ai = 0 then := {any 1-optimal set} 

else A2 := {an independent set recursively computed by Ak 
in G(i/\(AiUiV(Ai)))}; 

A ■- A1UA2; 

{local search for improvement} 
repeat 

oldsize := |A|; 

do all possible improvements of size max{3fc log \V\, 4fc + 2 k log |V^|} to A; 
do Acyclic Complement procedure; 

Find an optimal solution A3 in G(V \ A); 
if IA3I > |A| then A := A3; 
until |A| = oldsize; 

{postprocessing phase} 

Reconstruction of a solution for an input graph G 

Fig. 1. The algorithm Ak 

there is a path P = [vl,V2^ V3, V4] with both vertices V2 and V3 of degree 2, push 
P into the stack, remove V2 and V3 from G and, if {v\,V4) is not an edge in G, 
add it to G. 

Small Commitment reduction: If for some vertex subset U C V oi size at 
most fc, C/ is a maximum independent set in G{U U N{U)), add U to S, and 
remove U U N{U) from G. 

Nemhauser-Trotter reduction: This reduction in polynomial time partitions 
a set V into subsets V\, V2, V3 such that (i) there is a maximum independent 
set containing all the vertices of Vi but none of V2, (ii) there is no edge between 
Vi and V3, and (iii) a{Gv3) < ^Ihal. If V3 yf V, then add Vi to S and remove Vi 
and V2 from G. 

Definition 1. Let V{k) denote the set of graphs of maximum degree at most 3 
that are not reduced by the preprocessing phase of the algorithm Ak- 

Obviously, each graph G from V{k) has the following properties: 

(PI) each vertex is of degree 2 or 3, 

(P2) any two vertices of degree 2 are nonadjacent, 

(P3) a{G) < \\V\, and 

(P4) every vertex set U of size at most k is not a maximum independent set in 
G{UUN{U)). 
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( 2 ) An initial solution for a local search is constructed with preference given to 
vertices of degree 2. Let Ai be the set of all degree 2 vertices in the graph obtained 
by preprocessing. This is an independent set due to (P2). If Ai is empty, then we 
take any 1-optimal independent set as an initial solution A. Otherwise, we first 
obtain an independent set A 2 applying recursively Ak to GiV \ {Ai U fV(Ai))), 
and take A := Ai U A 2 as an initial solution we start with. 

(3) The algorithm Ak makes improvements of A in the following steps: 

(i) it makes A to be 0(fclog |P|)-optimal; 

(ii) without decreasing its size, A is improved by Acyclic Complement proce- 
dure to be an independent set with acyclic complement; 

(iii) the MaxIS problem is optimally solved for a graph G{V \ A) of degree at 
most 2, and if the solution is larger, A is updated. 

This is repeated in a cycle until no more improvements are achieved after steps 
(i)-(iii). We start with a local improvement (compare with [1]) to ensure that 
Acyclic Complement procedure always receives a 2-optimal solution as its input . 

Acyclic Complement procedure: If / is a 2-optimal independent set in a con- 
nected graph G = (V, E) of maximum degree 3 (and G is not a complete graph 
on 4 vertices) this procedure finds in polynomial time an independent set I' of 
size |/'| > |/| such that G{V \ I') is an acyclic subgraph (see [1] for details). 

(4) In the postprocessing phase the solution for an input graph G is constructed 
from the solution A found in the phase (3) for its reduced graph (obtained from 
G by preprocessing phase). The whole set S produced by Branchy reduction (i), 
Nemhauser-Trotter reduction, and Small Commitment reduction can be added 
to the solution A. For any path [ui, U 2 , U 3 , U 4 ] pushed into the stack by Branchy 
reduction (ii), we can add either V 2 or V 3 to A and keep it independent. 

Remark 1. It can be seen that the effect of all reductions from the preprocessing 
phase is no loss in approximation quality. Hence, to prove that Ak approximates 
3-MaxIS within 1 -|- <5, it is enough to do so for graphs from V{k). 

To find the upper bound on the approximation ratio of the algorithm Ak , we 
consider a graph G = (V, E) from V{k) as an input and let A' be the approximate 
solution provided by Ak- We try to find relations between its size and the size 
of an optimal solution. By B' denote one fixed optimal solution in G such that 
|A' n H'l is of the maximum size, and among all sets with this property B' 
contains maximum number of degree 2 vertices. Partition V into four sets, A 
(“approximative”), B (“best”), C (“common”), and D, such that C := A' C\ B' , 
A := A' \ G, B := B' \ G , D := V \ {A U B U C). For the reader’s convenience, 
we use the same notations as in [ 1 ]. 

Based on this partition of V , every vertex in D will be further classified 
according to its neighboring subsets. A degree 3 vertex is of type [XYZ] if 
its neighbors belong to vertex sets X, Y, Z, where each set X, Y, and Z, is 
one of sets A, B, G, D. In the same way we define for a degree 2 vertex its 
type [ATF]. Due to our assumptions that B' has maximum possible intersection 
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with A! ^ and additionally it has maximum number of degree 2 vertices (rela- 
tively to the first condition), there are only 4 types for degree 2 vertices from 
D {[AC], [BC], [CC], [CD]) and 13 types for degree 3 vertices from D {[AAB], 
[A AC], [ABB], [ABC], [ABD], [ACC], [ACD], [BBC], [BCC], [BCD], [CCC], 
[CCD], [CDD]). Denote the cardinalities of sets A, B, C, and D, by a, b, c, and 
d, respectively, and the number of degree 2 vertices in sets A, B, and C, by U2, 
62, and C2, respectively. The number of D- vertices of type [XYZ] (resp., [-’fT]) 
is denoted by its lower case counterpart, [xyz] (resp., [xy]). 

The purpose of the preprocessing and the acyclic complement procedures is 
to normalize a given graph and temporary solutions so that the final approximate 
solution has properties which allow us to compare its size with the size of an 
optimal solution. The proof of the following relations (l)-(7) among sizes of 
vertex subsets of the partition defined above can be found in [1]. 

— i — c > 3 [6c] + 2 [66c] + [bcd\ + [cdd\ + [cd\ + [6cc] -I- [cc] 

fx 

— (2[acc] + [aac] + [a6c] -I- [acd] + [ac]) (1) 

— i — c > [66c] + [be] + [6cc] + [cc] + —{[bed] + [edd] + [cd] + [cedX) (2) 
k 2 

c > (6 — a) -b [ccc] + [acc] + [aac] + [ac] + [cc] 

+ —{[edd] + [ced] + [acd] + [cd]) (3) 

k I 1 

— - — a > 3(6 — a) + a2 + 2[aa6] -b 2[aac] -b [a66] -b [abd] 

fx 

+ [acc] + [acd] + [abc] + [ac] (4) 

d > c -b (6 — a) (5) 

62 = 3(6 — a) -b 02 -b ([006] -b 2 [aac] -b [occ] -b [acd] + [oc]) 

— ([a66] -b 2 [66c] -b [6cc] -b [6cd] -b [6c]) (6) 

3c — C2 = 3 [ccc] -b 2 ([acc] -b [6cc] -b [ced] -b [cc]) -b [00c] -b [a6c] -b [66c] 

-b [acd] + [bed] + [edd] + [oc] -b [6c] -b [cd] (7) 

In our proof that Ak has approximation ratio < (1 -b 6), for a fixed constant 
S, we want argue by contradiction. Assume that for some graphs Ak does not 
perform so well, then for minimal counterexamples we can prove some addi- 
tional relation. This is given in the following Lemma which corresponds to the 
inequality (8) from [1] (choosing 6 = |). 

Lemma 1. Let k be a fixed positive integer and S € (0, 1) be such that the 
approximation ratio of Ak is > 1 + 5 . If C is a graph with minimum number 
of vertices for which Ak returns an independent set of size < y^o;(G'), then C 
belongs to V{k) and 

+ [6c] -b [cc] > ~{b2 -b C2 -b [oc] -b [cd]). 



(81-1-5) 
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Proof. As follows from Remark 1, G belongs to V{k). Hence G is not changed 
by preprocessing steps of Ak and the algorithm effectively starts with the con- 
struction of an initial solution. If G is 3-regular, then the inequality trivially 
holds with both sides equal to 0. Hence, we can assume in what follows that the 
set Ai of degree 2 vertices of G is nonempty. Ak takes as initial solution the set 
A := Ai U A 2 , where A 2 is the solution that Ak finds recursively for the graph 
G{V \ (Hi U N{Ai))) as an input. 

From the set Hi, opt^ vertices belong to B LI C and opt^ vertices have exactly 
i neighbors in H U C, i = 1, 2. Removing the set Hi U fV(Hi) from G, we can lose 
at most optQ+opti+ 2 opt 2 vertices from BL G and hence G(V \ (Hi U A^(Hi))) 
contains an independent set of size at least {b + c) — {opt^ + opt^ + 2 opt 2 ). The 
assumption of minimality of G implies that Ak finds an independent set H2 of 
size at least ii|^(6 + c — opt^ — opt^ — 2 opt 2 ) in this smaller graph, and hence 

|H| > {optQ + opt.y + opt 2 ) + + c — optQ — opti — 2 opt 2 ) 

= + c) + ~ ■ 

On the other hand, |H| < j^ce{G) by our assumptions. Combining the inequali- 
ties we get opt 2 > pzy(opfo + f^at can be easily translated as (81+5), even 

with the strict inequality in this case.D 

Theorem 1. The algorithm Ak approximates the 3-MaxIS problem within 3 — 
time where n is the number of vertices of an input 

graph. 

Proof. Denote <5* := 2 — e := , and 5 := i5* -be. We will prove that an 

approximation ratio of Ak is at most 1 -I- (5 via contradiction. Let G be a graph 
with minimum number of vertices for which Ak returns an independent set of 
size less than j^a{G). 

By Lemma 1, G belongs to V{k) and (81+5) holds together with (l)-(7). 
Summing up inequalities and equations (l)-(7), (81+5) with respective multi- 
plicative factors of 1 -b 26, 2(1 — 26), 4(1 — 26), 5 — 26, 1 — 26, 4(5, — 1 -b 26, 4(1 — 5) 
yields the inequality 

+ c)(3 — 25) > (14 — 45)(6 — a) -b (65 — l)(a2 + C2 -b [6c]) -b (2 -b 45)[cd] 

rZ 

-b (5 -b 25)[aa6] -b (7 — 25)[aac] -b (2 — 45)([a66] -b [acc] + [66c]) 

-b 2 {[abd] + [acd] + [cdd]). 

As clearly 5 G [|, i], all the terms in brackets are of nonnegative value. Thus the 
inequality holds with the right hand side simplified to (14 — 45) (6 — a), and we 
rewrite it as 1 4^-46 ■ other hand, our assumption a -b c < 

implies 5 < Combining these inequalities we get 5 < 14 - 4 ^^ j which 

we rewrite as k < Substituting 5 := 5* -b e, we get equivalently k < 
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^ 4 ~ ’ where the right hand side is clearly less than 

a contradiction. This contradiction shows that the approximation factor of the 
algorithm Ak is at most 1 + (5, as required. 

The time complexity of Ak is dominated by that of its local neighborhood 
search. A search space consists of all connected subgraphs of vertex set size 
at most max{3fclogn,4fc + 2/clogn}. One can show that the number of such 
subgraphs of size at most s in a degree 3 graph with n vertices is smaller that 
n3®. Hence, it takes time to find a small improvement. As a solution 

is augmented at most n times, Ak runs in time □ 

In [2] Berman and Fiirer have shown how one can use an algorithm with 
the approximation ratio tb -2 for {B — 2)-MaxIS to obtain an algorithm for 
H-MaxIS with the approximation ratio tb (for a fixed constant k) such that 



re < 1 + 



1 , B-2 

k{B + l) ^ B + l 



TB-2- 



For the approximation threshold is on polynomial time approximability of B- 
MaxIS (defined as Ib = inf{r > 1 : there is a polynomial time algorithm 
that approximates H-MaxIS within r}) this implies the recurrent relation ts < 



1 + 



B-2 j 



B+i^B-2- 

As follows from the results of [2] and [1], Ib < for any B > 2. Our 
Theorem 1 shows that < 3 — which in turn implies better upper bound 
on tB than for any odd i? > 3. If we rewrite ts in the form tB = —Sb, 
then we have that £3 > and the above recurrent relation reads as 

£b > 5 ^£b- 2- Therefore, for any odd B >3, 



B-2 B-4 5 3 (H-2)!! 4(5^13 - 18) (H - 2)!! 

- B+l ' B-1 " ' 8 ' 6 ' “ ^(H+ 1)!!^^ “ 5 (H + 1)!!' 



Finally, te < (b+i)!! H > 3. To our knowledge, 

this is currently the best upper bound on polynomial time approximability of 
H-MaxIS for any odd B, 3 < B < 613. See the paper by Halldorsson and 
Randhakrishnan [6] for comparison of results and for the range of B, when 
methods used to prove asymptotic result Ib = o{B) achieve upper bounds below 

B+3 

5 ■ 

Hence, we can summarize the results as follows 



Corollary 1. The H-MaxIS problem can he approximated in polynomial time 
with an approximation ratio arbitrarily close to , for 

every odd B >3. 

Remark 2. Berman and Fujito in [1] claimed that their algorithm Ak achieve for 
3-regular graphs approximation factors arbitrarily close to | (depending on k). 
However, they proved such approximation ratio of Ak only for 3-regular graphs 
belonging to V{k). In the following we observe that an input 3-regular graph 
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can be reduced by the preprocessing phase of Ak to the one with vertices of 
degree 2 as well. Hence, the fact that 3-regular graphs from V{k) have better 
approximation ratio cannot be simply extended to all 3-regular graphs. 

Let us consider a gadget H with 6 vertices a, b, c, a', b', and c', and 8 edges 
defined as follows: the triple a, b, c, and similarly a' , b' , c' , is a triangle in H 
and the triangles are connected by edges (a, 6') and (a',b). If an input 3-regular 
graph G contains a copy of H as an induced subgraph, then Small Commitment 
reduction (with k > 2) can commit itself to U := {a, a'} (respectively to U := 
{6, b'}) and remove all vertices H of the gadget from G. If there are many copies 
of H (or similar gadgets) in G, then the graph G' obtained from G by the 
preprocessing phase can have many vertices of degree 2. 

However, we can obtain slightly better estimate r G (|>3 — on the 

approximation ratio of Ak for 3-regular graphs, than for general 3 -MaxIS. We 
do not focus on estimating the precise value of r in this paper and provide just 
a sketch how such result can be obtained. 

Let G be a 3-regular graph and G' G ^{k) be the graph obtained from G 
after preprocessing phase of Ak- 

(1) If a relatively small fraction of vertices of G' is of degree 2, we can apply 
arguments similar to those of Berman and Fujito [1] in the proof of roughly | 
approximation factor for 3-regular graphs in V (k) . The method is robust enough 
to show that in case of “few” degree 2 vertices in G', Ak performs within r on 
G', and hence on G as well. 

(2) On the other hand, if the fraction of degree 2 vertices of G' is above 
some fixed level, then one can prove that the number of stored vertices in S is 
significant, and a{G) — a{G') is at least a fixed fraction of a(G'). It implies, that 
even if Ak{G') is guaranteed to be only roughly within 3 — of a(G'), the 
postprocessing phase of Ak will improve this approximation ratio for G to r. 
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Abstract. We study asynchronous packet radio networks in which trans- 
missions among nodes may be delayed. We consider the task of broad- 
casting a message generated by the source node. The timing of arrivals of 
messages is controlled by adversaries. We consider three different adver- 
saries. The edge adversary can have a transmitted message delivered at 
different times to different recipients. The crash adversary is the edge one 
augmented by the ability to crash nodes. The node adversary can have a 
message received at arbitrary times, but simultaneously by all the recip- 
ients. A protocol specifies for each node how many times the message is 
retransmitted by this node, after it has been received. The total number 
of transmissions of nodes is defined to be a measure of performance of a 
broadcast protocol, and is called its work. The radio network is modeled 
as a graph and is given as input to a centralized algorithm. An aim of 
the algorithm could be either to find a broadcast protocol, possibly with 
additional properties, or to verify correctness of a given protocol. We 
give an algorithm to find a protocol correct against the edge adversary. 
The obtained protocol is work-exponential in general. This is an inherent 
property of the problem, as is justified by a lower bound. We develop a 
polynomial algorithm to verify correctness of a given protocol for a given 
network against the edge adversary. We show that a problem to decide if 
there exists a protocol, for a given network and of a specified work per- 
formance, is NP-hard. We extend some of these results to the remaining 
two adversaries. 



1 Introduction 

In this paper we consider broadcast protocols in packet radio networks. One 
node is designated as a source and all the other nodes are reachable from the 
source along directed paths. The source is an origin of a message that needs to 
be disseminated among all the other nodes in the network. This is achieved by 
relaying the message from node to node. 

Radio networks studied in the literature are typically synchronous, with local 
clocks ticking at the same rate and producing the same round numbers. It is 
also usually assumed that a transmission by a node occurs within one round and 
occupies the whole round. 

The standard mode of radio communication has the following properties: 
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(i) if a message is sent by a node, then it is simultaneously sent to all the 
out-neighbors; 

(ii) a message sent at a round is delivered to the recipients in the same round, 
but is not necessarily heard, that is received successfully, by all of them; 

(iii) a message is heard by a node when it is the only message arriving at the 
node in a round. 

There are also the following two tacitly made assumptions. First is that the 
nodes given in a network specification are the only ones participating in commu- 
nication protocols. Second is that communication protocols, when actually run, 
can control the behavior of the nodes in the sense that a protocol is specified as 
a schedule that determines for each step whether a transmission occurs or not 
in the step, the nodes follow the protocol precisely as stipulated. In this paper 
we introduce models of asynchronous radio communication. Our approach is by 
way of relaxing these two tacit assumptions, and possibly some of the ones listed 
as (i-iii) above. 

Previous Work. Broadcasting problems have been considered in various com- 
putational settings, including centralized [1, 5-7, 15], distributed deterministic [4, 
8, 9, 11, 13, 14, 21-23], and distributed randomized [2, 14, 23, 25]. A lower bound 
l7(log^n) for broadcasting was given by Alon, Bar-Noy, Linial and Peleg [1]; 
it holds even for networks with a constant-bounded diameter and centralized 
protocols with complete knowledge. A lower bound f2{Dlogn) was shown by 
Bruschi and Del Pinto [4]. It was later improved to f2{nlogD) by dementi, 
Monti and Silvestri [13]. Fault-tolerant broadcasting in radio networks was stud- 
ied by Kushilevitz and Mansour [26] and by Kranakis, Krizanc and Pelc [24]. 

Gossiping is a natural extension of broadcasting, in which each node is a 
source of an individual message, and a goal is to have all the nodes learn all 
the messages, see [11,27,28]. Other problems of multiple communication were 
studied by dementi, Monti and Silvestri [12] and by Bar- Yehuda, Israeli, and 
Itai [3]. Radio protocols that are efficient in terms of energy use were given for 
the wakeup-like problem by Zheng, Hou and Sha [30], and for the leader-election 
problem by Jurdzihski, Kutylowski and Zatopiahski [19]. 

Broadcasting in radio networks is a special case of the wakeup problem. 
In that problem, initially only a source node is awoken, while the remaining 
nodes join in disseminating the source message after having received a wakeup 
message from the source. The wakeup problem for radio networks was first con- 
sidered by G^sieniec, Pelc and Peleg [17], in the case of single-hop networks, 
who developed deterministic and randomized protocols. A general version was 
considered by Ghrobak, G^sieniec and Kowalski [10]. The best known determin- 
istic wakeup algorithm for general ad-hoc networks was given in [10] and is of 
time logn) performance. The paper [10] also showed how to synchronize 

local clocks and how to solve leader election in radio networks, relying on a solu- 
tion to the wakeup problem, with only a logarithmic-time overhead. Jurdzihski 
and Stachowiak [20] studied randomized wakeup protocols for single-hop radio 
networks. Indyk [18] considered constructive wakeup protocols. 
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2 Technical Preliminaries 

A radio network is modeled as a directed graph G = (V, A), where R is a set of 
nodes, and A is a set of directed edges. An edge {x,y) G A is also denoted as 
a; — > y. When a: ^ y is an edge, then node x is its start point, and node y is its 
end point, while x is an in-neighbor of y and y an out-neighhor of x. The set of 
all the in-neighbors of y is denoted as iN(y), and the set of all the out-neighbors 
of X is denoted as OUT(a;). We assume that a network is represented by each 
vertex having a list of pointers to its in- and out-neighbors. 

For a node x and each outgoing edge x ^ y, there is a outgoing buffer 
ontx{y) at x associated with the edge. When the node x sends a message, then 
the message is put into each such a buffer ont^iz) at x, for any outgoing edge 
X ^ z. For a node y and an incoming edge x ^ y, there is an incoming buffer 
iny(a;) at y associated with the edge. The message in an outgoing buffer outa,(y) 
is eventually delivered from the sender x to the recipient y. This is achieved by 
removing the message from the outgoing buffer outa;(y) and putting it into the 
incoming buffer in^ (x) . At most one message can traverse an edge in a step from 
an out-going to the in-coming buffer. We distinguish a message being delivered 
from it being heard, that is successfully received. A message delivered at node 
a; at a round is heard at x if it is the only message delivered to x at this round. 
If more than one messages arrive at a node at a round, then none is received 
successfully. An interpretation is that all the messages are transmitted on radio 
waves of the same frequency and so many arriving simultaneously interfere with 
each other. A message delivered to a node y is removed from the incoming buffer 
in the same round. 

A broadcast protocol B is determined by a function : R ^ N that assigns 
a non-negative integer B{v) to each node v € V. Such a function is called a 
repeat-broadcast function and B{v) is a repeat-broadcast value of v. The protocol 
B determined by such a function operates as follows: once a node v receives a 
message, then it sends the message B{v) times. 

Timings of deliveries of messages are controlled by an adversary, once they 
have been put into the outgoing buffer. The adversary aims to disrupt the flow of 
copies of the broadcast message through the network. The only general constraint 
imposed on adversaries is that, during each step, if some buffer is nonempty, 
then at least one copy of the message is transmitted in this step. Transmitting 
a message is done by moving it from the outgoing buffer to the in-coming one. 
We consider three types of adversaries, each has to act subject to the general 
constraint. 

Edge adversary: It acts subject only to the general constraint. 

Crash adversary: It is the edge adversary augmented with the power to fail 
nodes by crashing. The adversary chooses the vertices to crash prior to the 
start of an execution. The faulty nodes do not participate in executions, and 
may be considered as removed from the graph. This may make some nodes 
unreachable; such nodes are also considered to be crashed. 
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Node adversary: It acts subject to an additional constraint: if a message is 
received at node y from some other node x, then at the same step the message 
is received at node z for any edge x ^ z. 

The work W{B) of protocol B is defined to be equal to the sum of the numbers 
of all the broadcasts at the nodes of G, that is W{B) = We can 

observe that the time after which the broadcast is completed is at most W{B) = 
since the adversary has to transmit at least one message in each 
step. Protocol B for a given graph G is correct with respect to adversary A if 
the message is heard eventually by each node of G, when the repeat-broadcast 
function of B is applied, and while it is A who decides on the timing of arrivals 
of each message broadcast. 

3 Protocol by Enforcing Dominance 

A node x is called a relay for node y if a; ^ y is an edge and the distance from 
the source to x is smaller than to y. Let S' C P be a set of nodes. A node x G S 
is said to dominate S if the inequality 

B{x)> Y. 

y&S\{x} 

holds, while the empty sum is assumed to be equal to zero. Observe that each 
node different from the source has at least one relay node, because all the nodes 
are reachable. If there is an edge x y and node x dominates all the in-neighbors 
of y, then x is said to guard node y, which is also denoted x = L{y). A node x 
is guarded if one of its neighbors guards it. 

Lemma 1. If, for each node y different from the source, there is a relay node 
that guards y, then the protocol is correct with respect to the edge adversary. 

Next we describe an algorithm Distance-Domination to find a protocol 
correct against the edge adversary. The algorithm assigns values of repeat- 
broadcast function to the nodes. In general, the work of the obtained protocol 
is exponential in the size of the network. 

Let us fix an enumeration (Ti)i<i<„ of all the nodes so that if the distance 
from the source s to Vk is smaller than the distance from the source to vi, 
then I < k. In particular we have that s = Vn- Initialize the broadcast function 
B{v) = 0, for each node v G V .It is next modified for each node, in the increasing 
order of indices. For a current setting of the repeat-broadcast function, define 
Cj = 1 + X)a;6iN(i> ) B{x). The repeat-broadcast function is set to its final value 
by considering all the nodes in increasing order of their indices. Take an index i, 
such that 1 < i < n and all 1 < A: < z have been processed. Set B{vi) equal to 
the maximum value among Cj over all edges Vi —>■ Vj, if at least one such an 
edge exists, and B{vi) = 0 otherwise. 

Theorem 1. Algorithm Distance-Domination produces a protocol that is cor- 
rect against the edge adversary. Its work is always less than 2”“^, where \ V\ = n. 
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The work-performance of a protocol produced by the algorithm Distance- 
Domination is exponential, which cannot be improved in general, as we show in 
Section 4. One could also observe that ordering the nodes on the distance to them 
from the source in increasing order of indices, and then assigning 2* to a node Vi 
as a value of a repeat-broadcast function, results in a correct protocol. This is 
because each set of nodes has a dominating element, hence Lemma 1 applies. 
If this is so simple, why to develop the algorithm Distance-Domination? An 
answer is that for some classes of graphs the work of a protocol obtained by this 
algorithm could be o(2”) for networks of n nodes. 

Theorem 2. The work of a protocol obtained by the algorithm Distance-Do- 
mination, on networks with in-degrees at most d, is where 1 < C < 

2(1 -2-<^). 

Proof. The numbers B{vm) satisfy an inequality < Y^T=m-d for 

m > d. Hence an inequality B(vk) < Ak holds, where the numbers are 

defined by the recurrence relation A^+d+i = ^fc+i + A ^+2 -I- ... -I- Ak+d and 
suitable initial values. This is a definition of d-step Fibonacci numbers. The 
characteristic equation of this recurrence is x'^ — x'^~^ — x'^~‘^ — ... — a; — 1. It 
has one positive simple root C in the interval [1, 2], which satisfies the inequality 
1 < C < 2(1 -2-<^), see [29]. □ 

4 Lower Bound 

In this section we show that there are networks requiring exponential work from 
broadcast protocols. More precisely, we construct a family of networks G„, each 
with n nodes, such that for any broadcast protocol on G„ correct against the 

1/3 

edge adversary, its work has to be at least c" , for some constant c > 1. 

We define a family of networks for each integer k > 0. Network Fk has 
(g) -I- /c -I- 1 nodes, partitioned into three layers. The top layer Lq consists of 
only the source node. The middle layer Li consists of some k nodes, and the 
bottom layer L 2 that includes the remaining (3) nodes. Directed edges connect 
the source with each element of layer Li. There is a one-to-one correspondence 
between the three-element subsets of Li and nodes of L 2 : for each such a subset 
X there is exactly one node v in the layer L 2 that X consists of the in-neighbors 
of V. 

Lemma 2. Let B be a broadcast protocol for F^. correct against the edge ad- 
versary. Then the following inequalities hold: B{vi) > 0 and B{vi) > fi- 2 , for 
k > i > 2, where fi is the ith Fibonacci number. 

Theorem 3. There is a sequence (G„)„>i of networks, such that G„ has n 
nodes and any broadcast protocol for this network, that is correct against the 
edge adversary, requires work where ^ = (1 -I- Vb)/2 is the golden 

ratio. 
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Proof. Take k = and consider F^. The network Fk is of a size 0{n). By 

Lemma 2, and properties of Fibonacci numbers, the work of the algorithm is at 
least ^ /*-2 = /fc ~ 1- Since fk is equal to (f>^ j\fh rounded to 

a closest integer, the work of the algorithm is □ 

5 Verifying Correctness 

Given a network G with a source s, and a repeat-broadcast function B, we 
consider a problem to verify if the broadcast protocol determined by B is correct 
against the edge adversary. 

For two nodes u and v, we say that node u depends on v if each path from the 
source s to m traverses either u or a node w with B{w) = 0. If an edge u ^ v has 
a property that u depends on v, then such an edge is said to be redundant. An 
edge that is not redundant is called significant. If edge u ^ v is redundant, then 
transmissions by u cannot block v from receiving a message, since node v receives 
the message before u. Redundant edges can be removed without affecting the 
set of nodes that receive the source message, for any behavior of the adversary. 

We may start with removing all the redundant edges. To identify redundant 
edges, do the following. For each vertex v, first remove v and all the nodes w 
with B{w) = 0, and then identify all the nodes than cannot be reached from the 
source by a directed path. If node u is among them and u ^ u is an edge of G 
then this edge is redundant. In the remaining part of this section we assume that 
any edge is significant. 

Lemma 3. If node v does not have a guard, then the adversary can block v from 
receiving the message. 

Lemma 3 implies that every vertex has to have a guard for the protocol to 
be correct. This is a necessary but not a sufficient condition however. At least 
one transmission by L(v) cannot be blocked by one of a different neighbor of v, 
because L(v) dominates them all. This shows: 

Lemma 4. If there is a guard of vertex v, and the guard receives the message, 
then v also will receive the message eventually. 

Now we develop a routine to verify if a vertex v can be blocked from receiving 
a message. A pseudocode for this routine is in Figure 1, where L(v) denotes a 
function that returns a guard of vertex v. Notice that guards may change as the 
repeat-broadcast function evolves. 

The routine maintains a set SBV, which stands for set-of-blocked- vertices, of 
vertices that cannot receive the message to block vertex v. Initialize SBV = {u} 
and modify B so that B{v) = Q since v will never broadcast as a blocked vertex. 
Remove all redundant edges for such a modified B. Lemma 4 implies that Lfv) 
cannot receive a message to block v. Add L{v) to SBV and modify B so that 
B{L{v)) = 0. Now remove all redundant edges for such a modified B. Next, for 
each vertex x in SBV, find its guard L{x) and add the guard to the SBV while 
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procedure Check-Vertex-Blocked ( v ) 
if V = s then return false 
SBV := {u} ; B{v) := 0 
remove all the redundant edges 
repeat 

NewVerticesToBlock := 0 
for each vertex u £ SBV 
if L{u) exists then 

NewVerticesToBlock := NewVerticesToBlock U {L(u)} 

B{L(u)) ■- 0 

remove all redundant edges 
SBV SBV U NewVerticesToBlock 
until NewVerticesToBlock = 0 
if s e SBV then return true else return false 



Fig. 1. A procedure to verify if node v can be blocked by the edge adversary. 



modifying B to be equal to zero on L{x). Keep iterating this, until at some point 
the set SBV stops increasing, which occurs when there are no more guards to be 
blocked. If the source s is in SBV, then v cannot be blocked, otherwise it can. 

Lemma 5. The procedure Check- Vertex-Blocked is correct. 

An algorithm Verify-Correct(G) operates as follows: for each node v G G 
verify if v is blockable by calling Check- Vertex-Blocked(u). 

Theorem 4. Given a network G of n nodes, and a repeat-broadcast function B, 
the algorithm Verify-Correct checks if the protocol B is correct against the 
edge adversary in time 0(n®). 

Proof. Each node v is added to the set SBV at most once, since after v has been 
put in SBV, its repeat-broadcast value is set to zero and v is never a guard. 
The until loop can be executed up to n times. The inner for loop takes |SBV| 
iterations. Each iteration consist of finding L{u), which takes time 0(|in(u)|) 
per call, followed by removing the redundant edges in time 0{n^). The runtime 

of Check- Vertex-Blocked is 0((1 -1-2-1 h n)n^) = 0{n^), which yields a 

total runtime 0(n®). □ 

6 Hard Problems for Edge Adversary 

We present two decision problems regarding broadcasting in asynchronous net- 
works, with timing of arrivals of messages controlled by the edge adversary, that 
are NP-complete. The first problem is about existence of a broadcast protocol 
with work bounded locally. 
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Problem: 0-1 Asynchronous Radio Broadcast 
Instance: A network G with a distinguished source. 

Question: Is there a broadcast protocol for G correct against the edge 
adversary and such that each vertex broadcasts at most once? 

Theorem 5. The problem 0-1 Asynchronous Radio Broadcast is NP- 
complete. 

The next problem is about existence of a broadcast protocol with work 
bounded globally. 

Problem: Work-Bounded Asynchronous Radio Broadcast 
Instance: A network G with a distinguished source, and a positive 
ger W. 

Question: Is there a broadcast protocol for G correct against the 
adversary and such that its work is at most W? 

Theorem 6. The problem Work-Bounded Asynchronous Radio Broad- 
cast is NP-complete. 

7 Crash Adversary 

The crash adversary has all the power of the edge adversary, but additionally 
can crash nodes. We assume that the source s is never failed. For a broadcast 
protocol B, the adversary chooses nodes to be crashed prior to the start of an 
execution. The protocol B is correct if the message reaches all non-faulty nodes 
that are reachable from the source by a directed path of non-faulty nodes. 

Verification of Correctness of a Broadcast Protocol. We would like to 
verify if a protocol B is correct for the given input network G. Node v is said to 
be blockable for B if the crash adversary can crash some nodes, possibly none, 
so that there is path P = (s, ui, . . . , Uk, v) from s to w consisting of non-faulty 
nodes, and node v never receives the message in some execution of B. 

Lemma 6. If there exists a node v that is blockable by the crash adversary, then 
there exist a node u that is blockable by crashing only some neighbors of u. 

Next we consider an algorithm Verify- Against-Crashes to verify correct- 
ness of a protocol, by way of checking if there exists a blockable node. For each 
node V check if there is a subset S C in(i;) such that S does not contain a guard. 
If for some node v such a subset S is found, then this v is blockable, and the 
protocol is not correct, otherwise B is correct. The algorithm is in Figure 2. 

Theorem 7. The algorithm Verify- Against-CrasheS(1BJ checks if B is cor- 
rect in time 0{n^). 

Proof. Correctness follows from Lemma 6. The outer for loop takes n iterations. 
The inner while loop takes at most n iterations. Removing redundant edges costs 
0(n^). The total runtime is thus 0{n^). □ 



inte- 

edge 
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algorithm Verify- Against-Crashes ( B ) 
remove all the redundant edges 
for each node u do 
C--B 

while L(v) exists for C do 
C(L(u)) :=0 

remove redundant edges for C 
if in(u) is nonempty then 

return “u is blockable” 
return “the protocol is correct” 



Fig. 2. Algorithm to verify correctness of B against the crash adversary. 



The algorithm Verify- Against-Crashes checks if there exist a blockable 
node, which can be done in polynomial time. Interestingly enough, the problem 
to check if a specific node is blockable is NP-complete. 

Optimal Work against the Crash Adversary. We show that work 17(2”) 
is sometimes necessary to have for a protocol that is correct against the crash 
adversary. This amount of work is then optimal, since it is always achievable, as 
observed in Section 3. 

Lemma 7. Let B he a broadcast protocol. If any subset of in{v) contains a guard, 
for each node v, then the protocol B is correct against the crash adversary. 

We say that vertex v is vulnerable if a removal of an arbitrary subset S C 
in(u) of nodes of v and also of the node v does not disconnect the nodes in in(u)\S' 
from the source s. Intuitively, a node v is vulnerable if in a situation when an 
arbitrary proper subset of neighbors of v have been crashed, the remaining in- 
neighbors of v can receive the message before v receives it. 

Lemma 8. Let v G G he a vulnerable node. If protocol B is correct for G, then 
each subset of in ( v) contains a guard. 

Lemma 9. Let V = {uq, . . . ,u„_i} and if :V If if has the property that 
for each subset S C V there exists an element v G S such that v dominates S, 
then the inequality T{vi) > 2* holds. 

Next we define a network H. It consists of n nodes partitioned into three 
layers. The top layer Lq consists of one source vertex s. The middle layer L\ 
consists of n — 2 nodes. The bottom layer L 2 consists of one vertex v. The vertex 
s is the in-neighbor of each vertex from layer Li and each vertex from Li is the 
in neighbor of the vertex v. We can see that vertex v is vulnerable. 

Theorem 8. There exists a network G of n nodes for which any protocol B 
correct against the crash adversary has work 17(2”). 
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Proof. Consider the network H . Vertex v is vulnerable, so by Lemma 8 there 
exists a guard in each subset of Li. By Lemma 9, the smallest possible amount of 
work is achieved when vertices in Li transmit 2°, 2^, , 2”“^ times respectively. 
Hence the work is 17(2”). □ 

NP Complete Problem for Crash Adversary. Next we describe a decision 
problem regarding broadcast protocols correct against the crash adversary that 
is NP-complete. 

Problem: Asynchronous Radio 3-Broadcast with Crashes 
Instance: A network G. 

Question: Is there a repeat-broadcast function for G that assigns at most 
three transmissions to each node so that the resulting broadcast protocol 
is correct against the crash adversary? 

Theorem 9. The problem Asynchronous Radio 3-Broadcast with 
Crashes is NP-complete. 



Broadcast Protocols and Colorings. We show that a vertex coloring yields 
a broadcast protocol. Let F be the graph and x(F) its chromatic number. We 
assume that colors used to color F are the numbers {0, . . . , x{P) ~ !}• The graph 
is colored so that the color 0 represents to most frequently used color, color 1 
represents the second most frequently used color, and so on, with color x(J^) — 1 
representing a least frequently used color. 

Given network G, build graph F{G) as follows. The nodes are those of G. 
For each pair of edges u ^ w and v ^ w of G, create edge (u, v) in F{G). 

Theorem 10. Ifx{F{G)) is a chromatic number of the graph F{G), then there 
is a broadcast protocol B for network G with work /x{F{G))). 

Proof. The coloring guarantees that, for each u € G, all the in-neighbors of u 
have different colors. For each node v, set B{v) to be equal to Such an 

assignment guarantees that each node will have a guard node in each subset of 
its in-neighbors, so it is correct by Lemma 7. The total work of the protocol is 
at most /x{F{G))), since the maximal size of a clique in F{G) is 

xiF{G)). □ 

8 Node Adversary 

The node adversary is weaker than the edge one, in the sense that each algorithm 
correct against the edge adversary is also correct against the node one. 

Theorem 11. There is a sequence (G„)n>i of networks, each of n nodes and 
any broadcast protocol for this network, that is correct against the node adversary, 
requires work I7(c'/"), for a constant c > 1. 
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Consider a counterpart of the problem Work-Bounded Asynchronous 
Radio Broadcast, which was shown to be NP-hard in Section 6. The problem 
is as follows: Given a network and an integer w > 0: is there a protocol for this 
network that is correct against the node adversary and of work at most w7 We 
conjecture that this problem does not even belong to NP. If true, this would 
imply that there is no deterministic polynomial algorithm to verify if a given 
broadcast protocol is correct against the node adversary. 

9 Conclusion and Open Problems 

We showed that sometimes the work of broadcast protocols for asynchronous 
networks has to be exponential in the size of the underlying networks. A reason 
of this phenomenon is that the adversaries considered are very strong. It would 
be interesting to identify weaker but natural adversaries, for which polynomial- 
work protocols always exist. 

The problems considered in this paper concern centralized algorithms. Intro- 
ducing a viable model of asynchronous radio networks and developing distributed 
communication protocols for them is an interesting topic to pursue. 
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Abstract. We consider the problem of D-hop virtual path layout in 
ATM (Asynchronous Transfer Mode) networks. Given a physical net- 
work and an all-to-all traffic pattern, the problem consists of designing a 
virtual network with a given diameter D, which can be embedded in the 
physical one with a minimum congestion (the congestion is the maximum 
load of a physical link). Here we propose a method to solve this problem 
when the diameter is 2. We use this method to give an asymptotically 
optimal solution for the 2-hop virtual path layout problem for all-to-all 
traffic when the physical network is a mesh, a torus or a chordal ring. 



1 Introduction 

Asynchronous Transfer Mode (ATM) is the world’s most widely deployed back- 
bone technology. This standards-based transport medium is widely used in all 
kinds of telecommunications networks to send data, video and voice at ultra 
high speeds. An ATM network consists of nodes and physical links connect- 
ing the nodes of the network. There are two kinds of routes in ATM network, 
called virtual paths and virtual channels. A virtual channel is used to create a 
connection between two nodes that wish to communicate, while a virtual path 
serves an internal network role and is used to build several virtual channels that 
share part of their route. Each virtual path corresponds to a simple path in the 
network while a virtual channel is a concatenation of several virtual paths. A 
D-hop virtual channel is the concatenation of D virtual paths. Given a set R of 
pairs of nodes in a network, the D-hop virtual path layout (VPL) problem is the 
problem of determining a set of virtual paths and virtual channels so that any 
pair of nodes in R is assigned an Ghop virtual channel with £ < D. The cost of 
implementing a Z?-hop virtual path layout depends on the maximum load of any 
physical link, i.e., the number of paths that share any physical link. A layout 
that minimizes this maximum load will have the smallest cost. At the same time, 
the transmission cost on a virtual channel is proportional to the hop count D, 
and thus it is useful to consider virtual path layouts for small values of D. 

* The work was supported partially by NSERC, Canada and was done while the first 
author was visiting Concordia. 
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The VPL problem for a set of communicating nodes i? in a given graph G 
and fixed hop count D can be seen as finding a virtual graph with the vertex 
set R and with diameter D, and then finding an embedding of the virtual graph 
in G that minimizes the load. For instance, if we consider the all-to-all traffic 
pattern in a graph G and are interested in only one-hop paths, the only possible 
virtual graph is the complete graph. Thus the problem reduces to finding an 
embedding of the complete graph in G with minimum load. When the number 
of hops is more than one however, one has to choose both the virtual graph and 
the embedding. We formalize this model below. 

Let G = {V, E) and H = {V, E') be two undirected graphs with the same set 
of vertices V. An embedding P of H in G consists in associating with each edge e 
of E[ an elementary path P{e) in G. So P is a mapping of E' into the set of paths 
in G. The pair (P, P) is called a virtual path layout (VPL for short) on G. 
G is called the physical graph and H is called the virtual graph. The edges 
of G are called physical edges and the edges of H are called virtual edges. 
The load of a physical edge e of the graph G = (V,E) for the VPL (H,P), 
denoted by 7 t(G, H, P, e) is the number of paths of P{E') which use the physical 
edge e : 7 t(G, P, P, e) = | { e' € E' s.t. e € P(e') } |. The maximum load 
of the embedding is denoted by tt{G,H,P) = maxees { tt{G, H, P,e) }. A 
virtual graph H on G can be embedded in several ways; we denote by 7 t(G, P) 
the minimum of 7 t(G, H, P) over all embeddings P. As stated in the introduction, 
we are interested in virtual graphs El of diameter at most D. Our aim is to find 
a virtual graph El which has the minimum value 7t(G, P) among all the 
graphs with diameter at most D. We will denote this minimum value 
^(G,P). 

In [2], authors give a general lower bound for this problem: 

Theorem 1 ([2]). For any instance {G,D), where G is an n-vertex graph of 
bounded genus and bounded degree, 7t(G, D) = fl . 

As mentioned earlier, if P = 1 there is only one possible virtual graph, namely 
the complete graph and 7 t(G, 1) is related to the edge-forwarding index [7]. For 
P > 1, the problem has been solved exactly when the physical graph G is a chain 
[3] and for P = 2 when G is a cycle [4]. For D = 3i, where i is an integer, the 
authors of [2] give constructions for meshes which are asymptotically optimal. 
Results concerning other physical graphs and set of requests can be found in [5, 
6]. As far as we know, there are no previous asymptotically optimal results for 
tori and meshes for P = 2. 

In this paper, we give an asymptotically optimal solution to the problem 
of 2-hop virtual path layout for all-to-all traffic in meshes and tori. In order to 
achieve a 2-hop layout, our method is to designate some nodes in the graph to be 
switches, which are used to interconnect the remaining nodes. A similar approach 
was used in [4] . The novel idea we use here is that the connections between nodes 
are specified according to a suitable projective plane. This enables us to achieve 
a diameter of two, as well as to minimize the load on the links adjacent to 
switches. By a careful placement of the switches and assignment of paths to 
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switches, we are then able to achieve an asymptotically optimal load for meshes, 
tori, and chordal rings. The constant factor in the leading term for the load in 
our construction is much smaller than that in the result of [2]. 

The rest of the paper is organized as follows. The next section describes the 
construction of our family of virtual graphs and its properties. Section 3 gives 
our results on tori, meshes and chordal rings. We conclude with a discussion of 
open problems in Section 4. 



2 The Virtual Graph 

One way of constructing a virtual graph of diameter 2 is to designate a set of 
nodes to be so-called switches such that every node is connected to some subset 
of switches in such a way that for every pair of nodes, there is always one switch 
that they are both connected to. This can be achieved by a star topology, where 
all nodes are connected to the same central switch. However, the congestion at 
the central switch is obviously very high if its degree is bounded. To distribute 
the task of this central switch, the nodes are divided into subsets, which are 
interconnected by means of several switches. To specify which subsets of nodes 
are connected to which switches, we use the notion of projective planes. 



2.1 Projective Planes and Difference Sets 

Definition 1 (Projective Plane [8]). For q a power of a prime, a finite pro- 
jective plane of order q denoted PP{q), consists of a set X of q'^ q 1 elements 
called points, and a set L of (jqF Vj — element subsets of X called lines having 
the property that any two points lie on a unique line. 

The existence of a projective plane PP{q) is related to the existence of a field 
of order q, and we have: 

Lemma 1. [9] There is a finite projective plane PP{q) whenever q is a power 
of a prime number. 

We use the properties of projective planes given by the following lemma: 
Proposition 1. In a projective plane of order q, the following holds: 

— Every point lies on q 1 lines; 

— Two lines meet in a unique point; 

— There are q^ q-\- 1 lines. 

One way of constructing projective planes is using the notion of difference 
sets. Since this construction is helpful to us in calculating the load of the em- 
bedding of the virtual graph created by using the projective plane, we describe 
it below: 
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Definition 2 (Difference set [1]). A difference set modulo n is a set of q+1 
positive integers 

{do) di, c? 2 , da, dg}, where n> q+1, with the property that all differences 
di — dj modulo n for i j are different. The numbers n and q + 1 are called the 
modulus and size of the difference set respectively. 

The operation of adding 1 (modulo n) to each integer in a difference set is 
called a shift, and by performing a shift, one may obtain a new difference set. 
Thus, one can perform q^ + q successive shifts to obtain q'^ + q+l distinct differ- 
ence sets. In other words, if D is the difference set {do, . . . ,dq} , the difference set 
Di = {do + i(mod n), . . . ,dq + j(mod n)} for i € (0, . . . , n — 1}. For example, for 
q = 2,n = 7, the set {1,2,4} can be verified to be a difference set. By performing 
shifts modulo 7, we can obtain the difference sets {2, 3, 5}, {3, 4, 6}, {4, 5, 7}, 
{1,5,6},{2,6,7}, and {1,3,7}. 

Difference sets are closely linked to projective planes as shown by the follow- 
ing theorem: 

Theorem 2. Let D be a difference set of size (? -I- 1 where q > 2 is a power of a 
prime and modulus n = q^ + q + 1. Then D induces a projective plane PP{q) of 
order q which has the following properties: 

— the points of PP{q) are the integers {0, 1, 2, 3, ..., q^ + q\. 

— the lines of PP{q) are the difference sets obtained by performing n successive 

shifts on D. 

The projective plane given by the difference set {1,2,4} modulo 7 and its 
shifts is the so-called Fano plane and is shown in Figure 1. 




Fig. 1. The Fano plane 



2.2 Virtual Graph Construction 

We now describe the key idea of our construction of a family of virtual graphs. 

Definition 3 (Projective Graph). Let q be a power of a prime, and let the 
number of vertices n be fixed such that n > q^ + q + 1. Then a projective graph 
H{n,q) is constructed as follows: divide the vertex set V into q^ + q + 1 sets, 
Vo, Vi, . . . , Vq 2 _^_q, corresponding to the points of PP{q). Each set Vj has a special 
designated vertex called the switch, denoted Si. Lf the line i of PP{q) consists 
of the points i\,i 2 , . . . ,iq+i, then each vertex in the set Vj is connected to the 
switches , . . . , , barring any self-loops. 



Two-Hop Virtual Path Layout in Tori 



73 



Lemma 2. For q a power of a prime and n > + q + 1, a projective graph 

H{n,q) has diameter 2. 

Proof. Let x and y be two vertices in the graph H (n, q) . We show that in each 
of the cases below that dist{x,y) < 2. 



— x,y € Vi for some i: From the definition of a projective graph, it follows that 
X and y are connected to the same set of switches, which is a non-empty set, 
and so dist{x,y) < 2. 

— X G Vi, y € Vj where i yf j: Since any two points of PP{q) lie on a line, 
there is a switch k such that either x = Sk or y = Sk and they are directly 
connected, or x and y are both connected to Sk, in which case dist{x, y) = 2. 



Lemma 3. Let G he an n-vertex graph of maximum degree A and H(ji,q) 
projective graph, 






n{q + 1) 
A{q^ + 9 + 1) 



9+1 

Z\ 



a 



Proof. Let {Vi} be the partition of the vertices such that each Vj corresponds to 
a line of PP{q) of a projective graph H{n, q). As each vertex of Vi is connected 
to 9 + 1 switches, the sum of the degree in FI of the switches is at least (9 + 1) x 
X)o<i<g2+g |l^l- So one of the switches has a degree in F[ of at least — 

(9^ + 9 + 1)) and the virtual edges connected to this switch can be balanced on 
at most A physical edges. 



Corollary 1. Let G he an n-vertex planar graph of hounded genus and hounded 
degree. Lf H{n,q) is a projective graph and tt{G,H) = 0 then 9 = 

0 (^). 

Proof. Theorem 1 shows that, if G is of bounded genus and bounded degree, 
7 t(G, 2) = L? . Lemma 3 indicates that, to achieve this bound, 9 must be 



3 Embedding the Virtual Graph 

In this section, we describe some ways in which a virtual projective graph as 
described above can be embedded in a torus, mesh, and some chordal rings. We 
give bounds on the load of these embeddings. 

3.1 Torus and Mesh 

We denote by Ti^xe a torus with L rows and £ columns, vertex Xij denotes the 
vertex of the torus in row i and column j, where 0 < i < L,0 < j < £. The 
following lemma shows that one can define a projective graph as described in 
the last section that can be embedded in some square tori such that the load is 
asymptotically optimal. 
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Lemma 4. Let q be a power of a prime and L = £ = q'^ + q+1. Then there exists 
an embedding of a projective graph in Tj^xi such that the load on any horizontal 

edge of the torus is at most + o ( \/n?) and the load on any vertical edge of 



r 4/ — 3 

the torus is at most — h o 

torus. 




where n is the number of vertices in the 



Proof. Consider the torus ^ such that q^ + q + 1 = ^/n with q a power 
of a prime. We specify below a projective graph and its embedding (or rout- 
ing) in with an asymptotically optimal load. Using the difference set 

{do, di, . . . , dq} (with do < d\ < ... < dq), construct a virtual projective graph 
H{n,q) as follows: 

— Partition the set of vertices V into the subsets Vi {Q <i < q^ -\- q) such that 

Vi is the set of vertices in the z-th row, that is, Vi = {xij : 0 < j < -I- g}. 

— Designate the middle vertex of row z, that is, Xi^(_q 2 _^q )/2 to be the switch Si 
of the vertex set Vi . 

— The edges of H are, for v G Vi, [u, ^od q'^+q+i] for 0 < z < g^ -I- g and 
d G {do, . . . , dg} (barring self-loops). 

We will associate a path in ^/n with each edge of H such that the maximum 
load is 0 . Let i,j and k be integers such that Sk is connected to Xij in H. 

The associated path in T/^^x V" follow the shortest path along the horizontal 
edges from Sk to Xkj followed by the shortest path along the vertical edges from 
Xk,j ^i,j ' 

Figure 2 shows the groups, switches and paths connecting group Vq to switches 
So, Si and S 3 with the construction using a projective plane of order 2 with the 
difference set {0,1,3}. 




Fig. 2. Virtual edges between Vo and switches so, si and S3 and their routing in Ttxt 

Using this routing, the load on the horizontal edges of T/^jx^n type 
(xij,Xij±i) is only induced by edges connecting Si to non-switch vertices and 
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its maximum is reached on the edges (a:j_((g 2 _|_g)/ 2 )-ij Si) and (si, Xi_((q 2 _|_^)/ 2 )-i-i)- 
Since Sj is connected to (g -I- 1) x (g^ -|- g) non-switch vertices and the load is 
balanced on these two edges, the maximum load on any horizontal edge (called 

maximum horizontal load) is equal to -g q 

The maximum load on a vertical edge of (called maximum vertical 

load) is determined by the chosen difference set {do,di, . . . ,dg}. Each vertex of 
the torus is the endpoint of g -I- 1 paths between the vertex and switches and 
these paths start with vertical parts of length di if di < (g^ -I- g) /2 or of length 
di — g^ -I- g if di > (g^ -I- g) /2, since the vertical part does not need to be more 
than the half of the number of rows. Since the vertical parts are the same for 
each vertex, it is easy to observe that the load on any vertical edge of the torus 
is the same and is the sum of the lengths of the vertical parts which is equal to 
J2i=o niiii{di, |g^ -I- g — di|}. To obtain an upper bound on the value of the sum, 
observe first that any element of the sum is at most equal to (g^ -I- g) /2 and there 
can be at most two such values. Secondly, since {do, di, . . . , dq} is a difference set, 
the differences between any two consecutive elements in the difference set must 
be distinct. Using these two facts, we can obtain the required bound. When g is 
even, the required sum would be at most obtained by taking dq /2 = (g^ + g)/2, 
dq/ 2 +i = (d^ + d)/2 + 1 and dqj^-i = dq/2-i-i-i — 2i, 1 < z < g/2, and dg/2-i-i-i-i = 
dq/ 2 -i-i-i-i-i +2z + 1, for 1 < z < g/2 — 1. Similarly when g is odd, the upper bound 
can be obtained by taking d(g_i )/2 = (g^-l-g)/2, d(^q_i)/ 2 +i = ('Z^ + 9)/2+l and 
d{q-l)/2-i = d(q-iy2-i+l ~ 2z, and d(g_i)/2-|-l-|-i = C^( 9 -l)/ 2 -|-l-|-i-l + 2z -|- 1, for 

!<*<(?- l)/2. Thus X)Lo + d - dj < + d)/2 - z(z + 1) + 

E!=o”'((d" + d)/2 - i(i + 2)) = 5/12g3 + 5/8q^ + 7/12g <^ + o (^) . 

In Figure 2, the maximum vertical load given by the embedding is 3 and the 
maximum horizontal load is 9. We remark here that the multiplicative constant 
in the above bound can be improved by obtaining a better general bound on 
the vertical load which is determined by the difference sets. A similar projective 
graph construction can be embedded into more general cases of tori with optimal 
load as shown by the following theorem: 

Theorem 3. Let q he a power of a prime and L and I be two integers such that 
L = ci{q^ + q + \) and ^ = C 2 (d^ + d + 1) for some integer constants c\ and C 2 - 
Then 

7r(Tix^,2) = 0(^) 

where n is the number of vertices in the torus, which is asymptotically optimal. 

Proof. Using a construction similar to the one from Lemma 4, we divide the 
torus into consecutive bands of Ci rows with one switch in a central position 
in each band. The vertical load increases by a factor proportional to ci and 
the horizontal load increases by the factor ciC 2 . Thus the maximum load is 
proportional to g^. Clearly, n = 0{q^) and the lower bound given by Theorem 1 
is C(g^). 
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We denote by M^xt the mesh with L rows and £ columns. It is easy to prove 
that IT D) < T^{MLxe,D) < 27r(r£,xtj -D). It is worth pointing out however, 
that in the embedding described in Lemma 4, no horizontal wraparound edges 
are used, and hence the maximum horizontal load for the mesh is identical to 
that in the torus, while the vertical load in the case of the mesh can be at most 
double that in the torus. 

Theorem 4. Let q he a power of a prime and and L = £ = q'^ + q + 1. Then 
there exists an embedding of a projective graph in T^xt such that 

7t(Mlx^, 2) < + o 

where n is the number of vertices in the mesh, which is asymptotically optimal. 



Improving the Result for Torus. As we can see from Figure 2, the horizontal 
edges are not evenly loaded, and the load on some horizontal edges is greater than 
the load on any vertical edge. The lower bound of Lemma 3 is and to achieve 
this bound, the load on all edges incident on each switch must be evenly balanced. 
This can be done with a better layout of the switches in the torus. In the torus 
with q^ + q+l rows and columns we distribute the switches as follows: Place the 
first switch in the last column. If the switch in row i is in located in column j, 
place the switch in row i + 1 in the column {j + {q'^ + q)/2) mod (g^ + g+ l). This 
layout of switches is shown for a 7 x 7 torus in Figure 3. With this layout, if we 
use the same routing as in Lemma 4, the horizontal edges just above and below 
a switch have very low load. Thus, we can re-route some of the paths coming 
to the switch from the left and right by using the lightly loaded part of the 
row above and below the switch and the vertical edges incident on the switch. 
Using this technique we can improve the bound on the load by a constant factor. 
For example, for the 7x7 torus, we can achieve a maximum load of 6 which is 
optimal over all virtual path layouts defined by projective graphs. In general we 
obtain the following result: 

Theorem 5. Let q be a power of a prime and L = £ = q'^ + q+ l. Then there 
exists an embedding of a projective graph in T^xi such that 



t^{Tlx£, 2 ) < 




where n is the number of vertices in the torus, which is asymptotically optimal. 



3.2 Chordal Ring 

The technique applied to meshes and tori in the previous section is applicable 
to many cases of the chordal rings. We can divide the chordal ring of length 
(<7^ + 9 + 1)^ into segments of length q“^ + q + 1. The ring edges can then be 
treated as horizontal edges in the torus, and the chord edges can play the role of 
the vertical edges if the chord length is not “too short” or “too long” . By using 
a similar embedding as described in Section 3.1, we obtain the following result: 
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Fig. 3. Layout of switches that balances the load 



Theorem 6. Let q be a power of a prime, n = {q^ + q + 1)^ and G he a chordal 
ring on n vertices with chord length k such that q^ + q + 1 < ak < 2(q^ 1) 

for some integer a. Then 7t(G, 2) = 

The same bound can be obtained if n = 0{q‘^). 

4 Discussion 

We give an asymptotically optimal solution to the virtual path layout problem 
for all-to-all traffic in meshes, tori and chordal rings with a maximum hop count 
of 2. Although we stated the results for the case when the number of rows and 
columns is proportional to q^ + q + 1 for a prime power q, we can easily extend 
our results to other sizes. For example, a grid having 10 rows can be partitioned 
into 7 segments such that 6 segments consist of one and half rows, one segment 
contains one row, and we proceed with q = 2. 

Our solution uses a construction based on a projective plane structure. It 
would be interesting to see whether this kind of finite geometry construction 
gives optimal or near-optimal solutions for other families of graphs. It would 
be also interesting to see whether some other finite geometries yield an optimal 
solution when the number of hops is greater than two. 



References 

1. L.D. Baumert. Cyclic Difference Sets. Lecture Notes in Mathematics 182, Springer- 
Verlag, 1971. 

2. L. Becchetti, P. Bertolazzi, C. Gaibisso, and G. Gambosi. On the design of efficient 
ATM routing schemes. Theoretical Computer Science, Volume 270, Issues 1-2:341- 
359, 2002. 

3. S. Choplin. Virtual Path Layout in ATM path with given hop count. In International 
Conference on Networking, ICNOl, volume 2094, Part II of LNCS, pages 527-537. 
Springer, 2001. 



78 



Sebastien Choplin, Lata Narayanan, and Jaroslav Opatrny 



4. S. Choplin, A. Jarry, and S. Perennes. Virtual network embedded in the cycle. 
Discrete Applied Mathematics, to appear. 

5. I. Cidon, O. Gerstel, and S. Zaks. A scalable approach to routing in ATM networks. 
In Proc. of the 8th International Workshop on Distributed Algorithms, WDAG ’94, 
volume 857 of Lecture Notes in Computer Science, pages 209-222. Springer Verlag, 
1994. 

6. O. Gerstel and S. Zaks. The Virtual Path Layout problem in fast networks (extended 
abstract). In Proceedings of the Thirteenth Annual ACM Symposium on Principles 
of Distributed Computing, pages 235-243, Los Angeles, California, 14-17 August 
1994. 

7. M-C. Heydemann, J-C. Meyer, and D. Sotteau. On forwarding indices of networks. 
Discrete Applied Mathematics, 23:103-123, 1989. 

8. S. Jukna. Extremal Combinatorics with applications in computer science. Springer- 
Verlag, 2001. 

9. W. D. Wallis. Combinatorial Designs. Marcel Dekker, New York, 1988. 




Robot Convergence 
via Center-of- Gravity Algorithms 



Reuven Cohen and David Peleg 

Department of Computer Science and Applied Mathematics, 
The Weizmann Institute of Science, Rehovot 76100, Israel 
{r . cohen,david.peleg}@weizmaim. ac . il 



Abstract. Consider a group of N robots aiming to converge towards 
a single point. The robots cannot communicate, and their only input is 
obtained by visual sensors. A natural algorithm for the problem is based 
on requiring each robot to move towards the robots’ center of gravity. 
The paper proves the correctness of the center-of-gravity algorithm in the 
semi-synchronous model for any number of robots, and its correctness in 
the fully asynchronous model for two robots. 



1 Introduction 

1.1 Background and Motivation 

In hazardous or hostile environments, it may be desirable to employ large groups 
of low cost robots for cooperatively performing various tasks. This approach has 
the advantage of being more resilient to malfunction and more configurable than 
a single high cost robot. Consequently, autonomous mobile robot systems have 
been studied in different contexts, from engineering to artificial intelligence (e.g., 
[1-11]). A survey on the area is presented in [12, 13]. 

During the last decade, various issues related to the coordination of multiple 
robot systems have been studied from the point of view of distributed comput- 
ing (cf. [14-20]). The focus is on trying to model an environment consisting of 
mobile autonomous robots, and studying the capabilities the robots must have 
in order to achieve their common goals. A number of computational models were 
proposed in the literature for such systems. In this paper we follow the models 
of [19-21]. The robots are identical and indistinguishable, cannot communicate 
between them, and do not operate continuously. They wake up at unspecified 
times, observe their environment using sensors which are capable of identifying 
the locations of the other robots, perform a local computation determining their 
next move and move accordingly. 

To model the behavior of the robots, several activation scheduling models 
have been suggested. In the synchronous model, all robots are active at every 
cycle. In the semi-synchronous model some robots are active at each cycle. In 
the asynchronous model, no cycles exist and no limit is placed on the latency 
of each robot movement. The models are presented more formally in the next 
section. 
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Two basic coordination tasks in autonomous mobile robot systems that have 
received considerable attention are gathering and convergence. The gathering 
task requires the robots to occupy a single point within a finite number of steps, 
starting from any initial configuration. The closely related convergence task re- 
quires the robots to converge to a single point, rather than reach it (namely, for 
every e > 0 there must be a time t^ by which all robots are within distance of 
at most e of each other). 

A straightforward approach to these problems is based on requiring the robots 
to calculate some median position of the group and move towards that position. 
The current paper focuses on what is arguably the most natural variant of this 
approach, namely, using the center of gravity (also known as the center of mass, 
the barycenter or the average position) of the robot group. 

The center of gravity approach is easy to analyze in the fully synchronous 
model. In the semi-synchronous or asynchronous models, however, analyzing the 
process becomes more involved, since the robots may take their measurements 
at different times, including while other robots are in movement. This might 
result in oscillatory effects on the centers of gravity calculated by the various 
robots, and possibly cause them to pass each other by in certain configurations. 
Subsequently, the correctness of the center of gravity algorithm has not been 
proven formally so far (to the best of our knowledge), i.e., it has not been shown 
that this algorithm guarantees convergence. 

Several different and more sophisticated algorithms have been proposed be- 
fore, some of which also guarantee gathering within finite time. The gathering 
problem was first discussed in [19, 20] in the semi-synchronous model. It was 
proven that it is impossible to achieve gathering of two oblivious autonomous mo- 
bile robots that have no common sense of orientation under the semi-synchronous 
model. (Convergence is easy to achieve in this setting.) Also, an algorithm was 
presented in [20] for gathering N > 3 robots in the semi-synchronous model. In 
the asynchronous model, an algorithm for gathering A^ = 3,4 robots is brought 
in [22, 16], and an algorithm for gathering N > 5 robots has recently been de- 
scribed in [15]. The gathering problem was also studied in a system where the 
robots have limited visibility [23,24]. 

Nevertheless, the center of gravity approach has several important advan- 
tages, making it desirable in many cases. 

— It requires a very simple and efficient calculation, which can be carried out 
on simple hardware, and requires very low computational effort. 

— It can be easily applied to 1, 2 or 3 dimensions and to any number of robots. 

— It has a bounded and simple to calculate error due to rounding. 

— It is oblivious, i.e., it requires no memory of previous actions and positions, 
rendering it both memory-efficient and self-stabilizing (in the sense that a 
finite number of transient errors cannot prevent eventual convergence). 

— It prevents deadlocks, that is, every robot can move at any given position 
(unless it is already at the center of gravity) . 

In the current paper, we address the convergence of the center of gravity 
algorithm. In Section 2 we prove the correctness of the algorithm in the semi- 
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synchronous model of [19]. The problem appears to be more difficult in the 
asynchronous model. In Section 3 we make a modest first step by providing a 
convergence proof in this model for N = 2 robots. 

Recently, we have been able to extend the result presented in the current 
paper and show that the center of gravity algorithm converges in the fully asyn- 
chronous model for any number of robots [25] . 



1.2 The Model 

The basic model studied in [14-20] can be summarized as follows (with some 
minor changes) . The robots are expected to execute a given algorithm in order to 
achieve a prespecified mission. Each of the N robots Ri in the system is assumed 
to operate individually, repeatedly going through simple cycles consisting of four 
steps: 

— Look: Identify the locations of all robots in Ri’s private coordinate system; 
the result of this step is a multiset of points P = {pi , . . . ,pn} defining the 
current configuration. The robots are indistinguishable, so each robot Ri 
knows its own location pi, but does not know the identity of the robots at 
each of the other points. 

— Compute: Execute the given algorithm, resulting in a goal point pa- 

— Move: Move on a straight line towards the point pa- The robot might stop 
before reaching its goal point pc, but is promised to traverse a distance of 
at least S (unless it has reached the goal). 

— Wait: The robot sleeps for an indefinite time and the awakens for the next 
Move. 

The “look” and “move” operations are identical in every cycle, and the dif- 
ferences between various algorithms are in the “compute” step. The procedure 
carried out in the “compute” step is identical for all robots. Notics that for the 
sake of analysis, the Compute and Wait steps can be absorbed into the Move 
step since no restriction is placed on the relative rate of movement during this 
step. 

In most papers in this area (cf. [26,19,23,16]), the robots are assumed to 
be rather limited. To begin with, the robots are assumed to have no means of 
directly communicating with each other. Moreover, the robots are also assumed 
to be oblivious (or memoryless), namely, they cannot remember their previous 
states, their previous actions or the previous positions of the other robots. Hence 
the algorithm employed by the robots for the “compute” step cannot rely on in- 
formation from previous cycles, and its only input is the current configuration. 
While this is admittedly an over-restrictive and unrealistic assumption, devel- 
oping algorithms for the oblivious model still makes sense in various settings, 
for two reasons. First, solutions that rely on non-obliviousness do not necessar- 
ily work in a dynamic environment where the robots are activated in different 
cycles, or robots might be added to or removed from the system dynamically. 
Secondly, any algorithm that works correctly for oblivious robots is inherently 
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self-stabilizing, i.e., it withstands transient errors that alter the robots’ local 
states. 

We consider two timing models. The first is the well studied semi-synchronous 
model (cf. [19]). This model is partially synchronous, in the sense that all robots 
operate according to the same fixed length clock cycles. However, not all robots 
are necessarily active in all cycles (the model is based on the assumption that 
each cycle is instantaneous). Rather, at each cycle t, any non-predetermined 
subgroup of the robots may commence the Look-Compute-Move cycle. The 
activation of the different robots can be thought of as managed by a hypothetical 
scheduler, whose only fairness obligation is that each robot must be activated 
and given a chance to operate infinitely often in any infinite execution. 

The second model is the fully asynchronous model (cf. [15, 16]). In this model, 
robots operate on their own (time-varying) rates, and no assumptions are made 
regarding the relative speeds of different robots. In particular, robots may remain 
inactive for arbitrarily long periods between consecutive operation cycles. 

To describe the center of gravity algorithm, hereafter named Alg. Go_to_C0G, 
we use the following notation. Denote by fi[t] the location of robot i at time t. 
Denote the true center of gravity at time t by c[t] = ^ dW- Denote by 
Ci\t] the center of gravity as last calculated by the robot i before time t, i.e., 
if the last calculation was done at time t' < t then Ci[t] = c[t']. Note that, as 
mentioned before, robot i calculates this location in its own private coordinate 
system; however, for the purpose of describing the algorithm and its analysis, it 
is convenient to represent these locations in a unified global coordinate system 
(which of course is unknown to the robots themselves). By convention Ci[0] = 
Ci[0] for all i. 

Algorithm Go_to_C0G is very simple. After measuring the current configu- 
ration at some time t, the robot i computes the average location of all robot 
positions (including its own), Ci[f\ = then proceeds to move to- 

wards the calculated point Ci [t] . (As mentioned earlier, the move may terminate 
before the robot actually reaches the point Ci[t], but in case the robot has not 
reached Ci[t], it must has traversed a distance of at least S. Also, in the semi- 
synchronous model, the move operation terminates by the end of the cycle.) 

2 Convergence in the Semi-synchronons Model 

In this section we prove that in the semi-synchronous model. Algorithm Go_to_ 
COG converges for any number of robots N > 2. 

We start with a technical Lemma. The robots’ moment of inertia at time t 
is defined as 

1 ^ 

-c[t])^ . 

i=l 

Lemma 1. Suppose that at some given time, all robots reside in a radius R 
circle centered at the origin, and let Q < xq < R. If the x coordinate of each 
robot satisfies x > Xq, then their moment of inertia satisfies I < 4(i?^ — Xq). 
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Fig. 1. Illustration of Lemma 1. 



Proof. Since the region A obtained from the intersection of the circle and the 
half-plane a; > Sq is convex, the center of gravity c must also lie inside A. Since A 
contains less than half the circle, the longest chord in A is the segment between 
the points a = (xg, — x'q) and b — {xg, — x'^) (see Figure 1). The 

length of this segment is L = 2^J — x^ and therefore the distance between 
each robot and the center of gravity cannot exceed L. Thus, 

' = < L», 

1=1 



□ 

Defining the function /j[f] = ^ W ~ notice the following (cf. [27]). 

Lemma 2. R > /g for every x. 

Lemma 3. For the semi- synchronous model, in any execution of Algorithm 
Go_to_COG, I[i\ is a non-increasing function of time. 

Proof. In each cycle t, each robot that is activated, and is not already at the 
current center of gravity, can get closer to the observed center of gravity c[t\, 
hence Ic[t][t] > Ic[t][t + Ij- By Fact 2, Is[t][t -b 1] > Fe[t-K][f + Ij- Combined, we 
have 

I[t] = ^ ^e[t-i-l] [f + 1] = ^[^+1] • 

□ 



Recall that S is the minimum movement distance of the robots. 

Lemma 4. If at some time tg the robots’ moment of inertia is Ig = I[tg\, then 
there exists some i > tg, such that I\ = /[f] < max{/ — (l — I}- 

Proof. Assume without loss of generality that c[to] is at the origin. Take i to 
be the robot most distant from the origin at time tg, and assume again that it 
lies on the positive x axis, at point (i?, 0). Since it was the most distant robot, 
we have P? /N < Ig < Pf. Now take ti to be the time of this robot’s first 
activation after tg. By Lemma 3, I[ti] < Ig. We separate the analysis into two 
cases. First, suppose that the distance of robot i from c[fi], the center of gravity 
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at time t\, is at least Then, in its turn, it can move a distance at least 

m = min{jp^,S'}. This decreases I by at least ^ as required. Thus taking 
t = ti + 1, the lemma follows. 

If the robot cannot move a distance m in its turn, this implies that the 
distance of this robot from the center of gravity at time t is less than m. Since the 
circle is convex, no robot can reside at a distance greater than R from the origin. 
The difference between the x coordinate of robot i and c[t\\, the center of gravity 
at time t\, cannot exceed m < Since the center of gravity is the average of 

the robots’ locations, the x coordinate of each robot j must satisfy Xj > R—Nm. 
By Lemma 1, I[t\] < 4(i?^ — {R — NmY) < 8RNm < < (l — Toip^) ^[^o]- 

Thus taking i =t\, the lemma follows. □ 

Theorem 1. In the semi-synchronous model, for N >2 robots in 2-dimensional 
space, Algorithm Go_to_COG converges. 

3 Asynchronous Convergence of Two Robots 

Let us now turn to the fully asynchronous model. We prove that in this model. 
Algorithm Go_to_C0G converges for two robots. Our first observation applies to 
any N >2. 

Lemma 5. If for some time to, ri[to] and Ci[to] for all i reside in a closed convex 
curve, V , then for every time t > to, ri[t] and Ci[t] also reside in V for every 
l<i<N. 

Proof. For the Move operation, it is clear that if for some i, fi[to] and Ci[to] both 
reside in a convex hull then for the rest of the move operation q [t] = Ci [to] does 
not change and fjt] is on the segment [ri[to]>Ci[to]]i which is inside V. 

For the Look step, if = 2 then the calculated center of gravity is on the line 
segment connecting both robots, and therefore respects convexity. For N > 2 
robots the center of gravity is on the line connecting the center of gravity of 
N — 1 robots and the Nth robot, and the Lemma follows by induction. □ 

For the following we assume the robots reside on the a;-axis. For every t, let 
H[t] denote the convex hull of the points fi[t], f^lt], c\[t] and C 2 [t], namely, the 
smallest closed interval containing all four points. 

Corollary 1. For N = 2 robots, for any times t,to, ift > to then H[t] C H[to], 
namely, the points fi[t], f 2 [t], ci[t] and C 2 [t] reside in the line segment H[to]. 

Lemma 6. If for some time to, r 2 [to] > t^i[to]; C 2 [to] > i^i[to] and fi[f\ is a 
monotonic non-increasing function for t G [to>^; lb.en 

1. at all times t G [to 7 f]> A[t] > Ii[t] and C 2 [t] > r\\t], and 

2. if at time t* G [tojf] robot 2 performed a Look step, then C 2 \t] G [ri[t], f 2 [tj] 
at all times t G [t*,i\. 
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Proof. Suppose that during the time interval [to j ^ j robot 2 performed k > 0 
Look steps. Set = t, and if fc > 1 then denote the times of these Look steps 
by ti, ^ 2 ) • ■ ■ ) tfc G [toj tfe+i]- We now prove by induction that claims 1 and 2 hold 
also for all times t G [ti, ti+i] for 0 < i < k. 

For i = 0, observe that since robot 2 did not perform a Look step throughout 
the entire time interval [tojti], its location r 2 [t] at any time t G [toi^i] is in the 
interval [^ 2 (^ 0 ], C 2 [to]], which is entirely to the right of ri[to] > hence claim 
1 of the lemma follows. Claim 2 of the lemma holds vacuously, and we are done. 

We now assume that claim 1 holds for the interval and prove that 

both claims hold at any time in the interval [ti,ti+i]. Indeed, notice that by 
assumption robot 2 is to the right of robot 1 at time U , and since a Look step is 
performed at that time, C 2 [ti] is at the average position of the robots, hence it is 
also to the right of robot 1. Thus, C 2 [t] = C 2 [ti] > f\ [ti] > f\ [t] for all t G [ti, ti+i] 
and claim 2 follows for that interval. During the time interval [ti,ti+i], robot 2 
traverses the segment [c 2 [ti], f 2 [ti]], which is entirely to the right of ri[ti] > fi[t], 
hence claim 1 follows as well. □ 

Lemma 7. For N = 2 robots, for every time tg there exists a time t > tg in 
which Ci[t\ = fi[t] and Cj[i\ resides on the line segment [fi,f 2 ] for i ^ j and 
i,jG{l,2}. 

Proof. Let [a, 6] = H[to\. Take ti > tg to be the first time at which robot 
1 completes its next Move operation. Thus, fi[ti] = ci[ti]. Without loss of 
generality assume r 2 [ti] > ri[ti], i.e. f 2 [ti] falls in the segment [fi[ti],5]. There 
are now three possible configurations (see Fig. 2). 




Fig. 2. The three cases in the proof of Lemma 7: ordering of the robots. 



1. C 2 resides on the segment [fi,f 2 ]. Then take i = t\ and we are done. 

2. C 2 resides on the segment [r 2 ,b]. Then let t be the time when robot 2 com- 
pletes its current Move operation. Since robot 2 moves away from robot 1, 
Cl will always reside on the segment [fi, f 2 ] (by Lemma 6). At i, f 2 = C 2 and 
Cl resides on the segment [ri,f 2 ]. Thus, we are done. 
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3 . C2 resides on the line segment In this case take t2 to be the time at 

which robot 2 ends its Move operation. Whence, C2[t2] = r2[^2]- Since for 
any t € [^1,^2], r2[t] is on the right of C2[t] and ci[t] = (fi[t3] + f2[i3])/2 for 
some ta G it follows that c\\t] is to the right of C2[t] and therefore also 

fi[t] stays to the right of C2[t] = C2[ti]. Hence, at time t2, M and Ci are on 
the left of T2 = C2- Therefore, we reach either case 1 or 2 with the roles of 1 
and 2 reversed. This completes the proof. □ 

Lemma 8. For N = 2 robots, for every time to there exists a time t > to such 
that \H[i\\ < |H[to]|/2. 

Proof. Let L = |iL[to]|- Take ti > to to be the time when both robots have 
completed at least one cycle after to- By Corollary 1 they are still in H[to]. By 
Lemma 7 there exists t2 > ti such that at time t2, Ci[t2] = ri[t2] and C2[t2] G 
[^1,^2] (where robots 1 and 2 and the left /right directions are chosen to make 
this true). By Corollary 1 , [ri[t2],r2[t2]] C i 7 [to]. Take D = d(ri[t2],f2[t2]) < L. 

Now take to to be the time of the next Look step of robot 1 . Clearly Lx = 
c^(ci[t3],ri[t3]) < L/2 (since the center of gravity is the average of the robots’ 
locations and d(ri [ts], r2[t3]) < \H{to)\ < \H{to)\ = L). Now take t to be the 
time of the end of robot I’s Move operation, so ci[t] = fi[t]. Without loss of 
generality assume at to robot 1 is the rightmost. Either of the following three 
cases is possible at i (see Fig. 3 ). 




2 





b 



3 



a 




b 



Fig. 3. The three cases in the ordering of the robots in Lemma 8. 



1 . r2[t] is to the right of ri[t\. This means that for some time G 
ri\tA\ = 72(^4]. At any t G [^2,^4], hx[t] was to the right of V2\t], implying that 
C2[t] was also on the right of f2[t], and specifically C2[f] > ?’2[i] > ri[i\ = ci[i\. 
Therefore they are all on the segment [ 7 i[f],ri[t 3 ]] of length Lx < L/ 2 . 

2 . f2[t\ and C2[f| are to the left of rx[i\. Since robot 2 must have been moving 
right at all t2 < t < t it must hold that C2 [i\ resides on the segment [r2 [t] , 
and that d{rx[i],r2[i]) < d{rx[t2],r2[t2]) = D- Now, since d(ri[t2], Ci[t2]) = 
d{r\\t2],f2[t2])/‘^ = -D/ 2 , and r2[t] and C2[t] are to the left of fi[t] they are 
restricted to the segment left of ri[t] and to the right of r2[t2] of length 
D /2 < L/ 2 . 

3 . fx[i\ G [ 72 [f] , C2[t]]. Since d{r2[i\, C2[i\) < L /2 we are done. 



□ 
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Theorem 2. In the fully asynchronous model, for two robots in d dimensional 
space, Algorithm Go_to_COG converges. 

Proof. Apply Lemma 8 to each dimension separately. □ 
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Abstract. We propose a family of novel schemes based on Chord retain- 
ing all positive aspects that made Chord a popular topology for routing 
in P2P networks. The schemes, based on the Fibonacci number system, 
allow to improve on the maximum/ average number of hops for lookups 
and the routing table size per node. 



1 Introduction 

In this paper, we propose a family of new routing schemes that reduce the routing 
table size, and the maximum/average number of hops for lookup requests in 
Ghord-like systems [15] without introducing any other protocol overhead. The 
improvement is obtained with no harm to the simplicity and ease of programming 
that are some of the many good characteristics that made Ghord a popular 
choice. 

The basis of Ghord can be seen as a ring of N identifiers labelled from 0 
to V — 1. The edges, representing the overlay network, go from identifier x to 
identifier^ x + 2*, for each x G (0 , . . . ,N — 1} and i < log V. The degree and 
the diameter are log V, the average path length is (log V)/2. Routing is greedy, 
never overshooting the destination. 

Because of low diameter and average path length, Ghord offers fast lookup 
algorithms. By having also low degree, it provides efficient join/leave of nodes 
since the cost depends on the diameter and the degree and is, in fact, upper 
bounded by their product. Ghord is scalable: with n < N nodes present in the 
network the same performance (in terms of n rather than N) can be obtained 
w.h.p.. Efficient routing in Ghord is easy (a greedy algorithm is optimal) due 
to the fact that Ghord is uniform: x is connected to y iff a; -I- 2 is connected to 
y + z. Since we are restricting ourselves to uniform routing schemes, we can use 

* Work partially supported by EU RTN project ARACNE and by Italian EIRE project 
WebMinds. 

^ Throughout the paper, arithmetics on node identities is always mod N where N is 
the number of identifiers. Similarly, all the logarithms are base 2, unless differently 
specified. 
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the term jump of size s to indicate the existence in the overlay network of an 
edge from a: to a: + s for any identifier a; = 0, . . . , iV — 1 (e.g. Chord has jumps 
2*, for i < logN). 

Uniformity is a crucial requirement, since it makes any system a good can- 
didate for real implementations: besides simplicity in the implementation it also 
offers an optimal greedy routing algorithm without node congestion [16]. On the 
other hand, it is known that, as long as uniformity is required, the 0(log N) val- 
ues for the number of jumps and diameter cannot be asymptotically improved. 
Chord’s values of the three main parameters (degree vs. diameter and average 
path length) can be improved if one removes the uniformity request [4, 7-10, 12]. 

Because of its practicality and its known bounds, it assumes a certain rele- 
vance to improve the performance of Chord while retaining simplicity and seal- 
ability. Moreover, due to the above, it is interesting and practically useful to 
improve known results for uniform systems, even only by constant factors. Our 
objective is to show improved bounds on all the important parameters of a P2P 
system that affect lookup time and join/leave cost, i.e., degree, diameter and 
average path length. To this aim we will propose and analyze a novel family of 
uniform routing schemes. 

The lookup process in Chord can be seen as a binary search on an interval of 
N elements. A natural question to ask is whether the lookup can be realized with 
a more efficient search technique that can be translated into a uniform overlay 
network. Efficiency is measured in terms of degree and maximum/average path 
length. We notice that the problem poses several restrictions on the search model 
since we are assuming that all nodes are alike and, therefore, only queries taken 
from a globally given set can be used at any step. 

In this paper, we consider search techniques in the above model imposed 
by uniform routing algorithms (that is, when a fixed set of jumps is available) . 
To the best of our knowledge, while the problem is related to several search 
problems investigated in the literature, no useful results on problems totally 
fitting the above model and goals are known (see [3] and references therein 
quoted). A previous work in the P2P context in this direction is contained in 
[16]. Our starting point is Fibonacci search [2]. 



1.1 Our Results 

Let Fib(i) be the z-th Fibonacci number. We prove that any uniform algorithm 
that uses up to 6 jumps and has diameter d can reach at most N{S, d) consecutive 
identifiers where N (S, d) < Fib{6 -I- d -I- 1). This gives us a tradeoff of 1.441og 
on the sum of the degree and the diameter in any P2P network using uniform 
routing on N identifiers. We then show a family of routing scheme F-Chord(o;) 
that, besides improving all Chord parameters, also reaches equality for any choice 
of the parameters in the inequality above with [d — d| < 1. 

Our analysis has been carried out by considering the A^-size identifier space. 
We use a standard technique (see [15] for technical details) to manage the situa- 
tion when n < N nodes are in the network. In this case, in fact, we assume that 
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the n nodes are uniformly distributed at random on the N identifiers, therefore 
we can consider a ring of n “chunks” each with N /n identifiers. 

Since in each chunk there are at most 0(log n) nodes w.h.p., any deterministic 
result on the diameter in terms of N can be easily translated in the same result 
in terms of n, with high probability. The same argument can be used for the 
degree. In fact, the distance between two nodes is at least N /n^ w.h.p.. If 5{N) is 
the degree when all the N identifiers are used, then the number of finger pointers 
that are useful (and consequently stored) with n nodes is at most 26{N), w.h.p.. 

1.2 Related Work 

The Chord system was introduced in [15] to allow efficient lookup in a Dis- 
tributed Hash Table (DHT). By using logarithmic size routing tables in each 
node. Chord allows to find in logarithmic n number of routing hops the node of 
a P2P system that is responsible for a given key. Adding or removing a node 
is accomplished at a cost of 0(log^ N) messages. A thorough study of uniform 
system is given in [16]. 

In general. Chord can be improved at the expenses of uniformity [7,9,10]. 
Recently, some non-greedy routing algorithms were proposed, that use De-Bruijn 
based DHT [4, 11], whose goal is to reach an optimal trade-off between degree 
and path length and, in particular, allow routing in f2(log N/ loglog N) with 
logarithmic degree. 

One can also improve the results by eliminating the deterministic require- 
ment. In fact, it is possible (see [8, 12]) to route greedily in 6>(log A^/ loglog A^) 
with logarithmic degree by using randomization and the so called neighhor-of- 
neighbor (NoN) approach: a node uses, at each step, its neighbor’s neighbors 
to make greedy decisions. It is not difficult to see that these techniques can be 
easily adapted to our scheme thus obtaining similar results. 

However, in this paper we focus on deterministic and uniform routing schemes. 
Among other advantages, they offer an optimal greedy routing strategy that 
provides simplicity, fault tolerance (as long as some node has edges toward des- 
tination, the routing succeeds) and locality (messages flow only on the portion 
of ring between source and destination), as noted in [12]. 

1.3 Organization of the Paper 

The rest of the paper is organized as follows. First of all, we provide the proof of 
the lower bound in Section 3. Then, in Section 4 we introduce the F-Chord(a) 
family and in Section 5 we prove some of its properties with regards to the 
degree, diameter, average number of hops, and congestion. Finally, in Section 6 
we conclude with some remarks and give some open problems. 

2 Fibonacci Numbers 

We, briefly, recall here some basic facts on Fibonacci numbers which will be used 
in the sequel (see [6]). Let Fib{i) denote the z-th Fibonacci number. They are 
defined as Fib{0) = 0,Fib{l) = 1 and, for each z>l. 
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Fib{i) = Fib{i — 1) + Fib{i — 2). 

For each index i, it holds 

Fib{i) = 

where 4> = 1.618... is the golden ratio and [ ] represents the nearest integer 
function. Furthermore, 

P 1 

Fib{i) ■ Fib{p — i) = ~{p[Fib{p + 1) + Fib{p — 1)] — Fib{p)}. 

i=o ^ 

3 The Lower Bound 

In this section, we furnish a tradeoff of 1.44 log on the sum of the degree 
and the diameter in any P2P network using uniform routing on N identifiers. 
Namely, we prove the following theorem. 

Theorem 1 Let N (S, d) denote the maximum number of consecutive identifiers 
obtainable trough a uniform algorithm using up to 5 jumps (i.e. degree S) and 
diameter d. For any i5 > 0, and d> 0, it holds that 

N{S,d) < Fib{S + d+l) (1) 

Remark: It can be shown that inequality (1) is strict whenever |(5 — d| > 1. On 
the other hand, we will exhibit a routing scheme that reaches equality for any 
choice of the parameters with |i5 — d| < 1- 

Proof. We proceed by induction on the sum 6 + d. Trivially, N{0, 0) = N (1, 0) = 
N(0, 1) = 1 = Fib{l) = Fib{2). Assume that (1) holds for any 6 and d with 
6 + d < X. We will show that for any degree y and diameter z with y + z = x 
it holds that N {y, z) < Fib{x + 1). We distinguish three cases on the number of 
times the first (i.e. the biggest) jump is repeated. We recall that the assumption 
of a uniform algorithm implies that jumps can be used to build paths in a greedy 
manner with respect to their size, that is, on each path jumps of the same size 
can be assumed to be consecutive. 

Case 1: the first jump appears at most once on each path. In this case, each 
path, either starting with the given jump or not, will never use this jump again. 
This implies that (cfr. Figure 1(a)) 

Ni{y,z) < N{y -l,z) + N{y - l,z - 1) 

< Fib{y + z) + Fib{y + z — 1) = Fib{y + z + 1) = Fib{x + 1). 

Case 2: the first jump appears at most twice on each path. Since we have two 
equal jumps, the size of the jump cannot exceed the maximum number of nodes 
reachable using degree y — 1 and diameter z — 1 remaining after the first jump 
has been used (cfr. Figure 1 (b)). The part remaining after the second jump 
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cannot exceed the maximum number of nodes reachable using the remaining 
degree y — 1 and diameter z — 2. Hence 

^ 2 (y, z) < 2N{y -l,z-l) + N{y-l,z-2) 

< 2Fib{y + z — 1) + Fib{y + z — 2) = Fib{x + 1). 

Case 3: the first jump appears at most £ times on each path for some 2 < £ < z. 
As in Case 2, we can deduce that (cfr. Figure 1 (c)) A^ 3 (y, z) < £N{y —l,z — £ + 
1) + JV(y — 1, z — £), which in turn can be shown to give 

Ns(y, z) < Fib{y + z + 1) = Fib{x + 1). 

From the above inequalities, we get 

fV(y, z) < max{fVi(y, z),N 2 {y, z),Ns{y, z)} < Fib{x + 1). 




N(y-1,z-2) 



Fig. 1. The three cases in the proof of Theorem 1. 



4 F-Chord: A Fibonacci-Based Chord-Like System 

The purpose of this section is to introduce our schemes that are based on the 
Fibonacci number system applied to a Chord-like network, namely a ring of N 
identifiers labelled from 0 to fV — 1. 

The idea of using a different base (other than 2) for representing IDs (and 
consequently to route among them) is not new. It was already pointed out in 
[15], in fact, that any base b < 2 can be used for a Chord-like routing that 
trades off routing table size (i.e. degree) with number of hops (i.e. diameter). As 
a result, the jumps would be F, for each i < logj N and, consequently. Chord’s 
results are slightly changed: the degree is raised to log^ N while the diameter is 
lowered to log{,/(-j_]^) N. By using a Chord-based scheme with base one could, 
therefore, improve on the diameter by paying off a corresponding increase in the 
degree. 

Nevertheless, by exploiting the many properties of Fibonacci numbers, we can 
define and analyze a family of uniform routing algorithms that offers improved 
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performances on the various parameters - number of jumps, diameter, average 
path length, and edge congestion. Each member of the family can be obtained 
by tuning the parameter a, with 1/2 < a < 1. In particular, for a = 1/2 the 
corresponding algorithm meets the bound in Theorem 1. 

In this section, we define the families by first defining the generic Chord 
extension to Fibonacci numbers mentioned above. Successively we present the 
two families of routing schemes F-Chord. 

4.1 The Fibonacci Routing Scheme Fib-Chord 

For sake of simplicity, we first introduce the Fibonacci routing scheme Fib-Chord. 
Let N G {Fib{m — 1), Fib{m)]. The scheme uses m — 2 jumps of size Fib{i), for 
i = 2, . . . ,m — 1. Recall that m « log,^ N = 1.44042 log IV. 

The results of next section will imply, in particular, that, by using Fib- 
Chord over a set of N identifiers, the degree (i.e., the number of jumps) is 
1.44042 logfV — 2, the diameter (i.e., the number of hops) is 0.72021 logfV, and 
the average path length is 0.39812 log fV. 



4.2 The Family of Routing Schemes F-Chord 

Intuitively, we obtain the two families of routing schemes by taking Fib-Chord 
and pruning them, either starting from smaller size jumps or from larger size 
jumps. The pruning is realized by eliminating a certain quantity (that is related 
to the parameter a) of the jumps with odd indices. More formally, our family 
is indeed composed by two subfamilies named F^-Chord and Fh-Chord defined 
below. 



Definition 1. Let Fib{m — 1) < N < Fib{m) and a G [1/2, 1]. 
a) The Fa-Chord(a) scheme uses the \a{m — 2)] jumps 

Fib{2i), for i = 1, . . . , [(1 — a){m — 2)J 



and 

Fib{i), for z = 2[(1 — a){m — 2)J -|- 2, . . . , m — 1. 
b) The Fh-Chord(a) scheme uses the \a{m — 2)] jumps 

Fib{i), for z = 2, . . . , TO — 2[(1 — a){m — 2)J 



and 



Fib{2i), for z 



TO — 2[(1 — a){m 
2 



2)J 



-k 1,..., L(m 



1)/2J. 



We will use the name F-Chord(a) whenever we want to indicate any of 
the two schemes Fa-Chord(a) and Fh-Chord(a). Notice here that Fib-Chord 
= F-Chord(l) (=Fa-Chord(l)=Ff,-Chord(l)); moreover, Fa-Chord(l/2) = F^- 
Chord(l/2). 
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5 Properties of F-Chord 

In this section we investigate and bound the main parameters of the F-Chord 
systems. 

For the rest of this section, we always assume that Fib{m — 1) < fV < Fib{m) 
and a G [1/2, 1]. 

5.1 Diameter 

Theorem 2 For any value of a, the diameter of F-Chord(a) is 
[m/2\ « 0.72021 log iV. 

Proof. Since we use the greedy strategy it is enough to consider the case a= 1/2, 

i.e., when there are only jumps of even Fibonacci index. 

We can prove, by induction, that after performing i jumps, the search interval 
is reduced to Fib{m — 2z -|- 1). In fact, if t = 1 then we are considering a jump of 
size greater or equal to Fib{m — 2), and since N < Fib{m) the search interval is 
bounded by Fib{m) — Fib{m — 2) = Fib{m — 1). Consider the situation after t 
jumps. If we took a jump in the previous step, the size of the search interval is 
limited by Fib{m — 2t+ 1). Hence, Fib{m — 2t+ 1) — Fib{m — 2t) = Fib{m — 
2t — 1). If we did not take a jump in the previous step, the search interval is 
limited by Fib{m — 2f). Performing the jump restricts the search interval to 
Fib{m — 2f) — Fib{m — 2t — 2) = Fib{m — 2t— 1). For i = [m/2j the interval is 
thus reduced to Fib{2) = 1 and the search ends. 

5.2 Average Path Length 

We can precisely evaluate the average path length of F-Chord(a). Let us, first, 
denote by Li{i,m) (resp. Li/ 2 {i,m)) the load for an edge of size Fib{i) on all 
intervals of size Fib{m) when we use F-Chord(l) (resp. the F-Chord(l/2)). 

The following properties can be easily obtained: 

1. Li{m, m) = 0; 

2. Li(m — l,m) = Fib{m — 2); 

3. Li{i,m) = Li{i,m — 1) + Li{i,m — 2),i < m — 2. 

Indeed, either the destination is one of the first Fib{m — 1) nodes or among the 
last Fib{m — 2) nodes in the interval. To go from the first part to the second part 
we issue a jump of size Fib(m — 1) yf Fib(i). That jump is hence not counted in 

Moreover, for L 1/2 it can be proved that: 

4- — 1, m) = Fib{m — 2); 

5. Li/ 2 (w — 2, m) = Fib\m — 1) -I- Fih{m — 3); 

6- Li/ 2 {i, m) = Li/ 2 (f, TO — 1) -I- Lij 2 {i, to — 2) for i < m — 2. 

We can prove the following lemma. 
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Lemma 1. By using properties 1-6, it holds that: 

1) Li(i, m) = Fib{i — l)Fib{m — i) for 1 < i < m. 

2) Lij2(2i, m) = Fib{2i — l)Fib{m — 2i) + Fib{2i + l)Fib{m — 2z — 1) for 2 < 
2i < m — 1. 

Theorem 3 The average path length of the F-Chord(a) scheme ,is bounded by 

0.398121og7V + (1 - a)0.248051og7V+ 1. 

Proof. We sketch the proof for Fa-Chord(a), computation for Ff,-Chord(a) is 
quite similar. For sake of simplicity, we assume that N = Fib(m). The sum of 
the total link load in the network, denoted by Sa(jn), equals the sum of the 
lengths of each path. It is easy to show that: 

m-2 L(l-a)(m-2)J 

Sa{m) = ^ Fib{i)Fib{m — z — 1) + ^ Fib{2i — l)Fib{m — 2z — 1) 

i=l i=l 



We have 

TO— 2 [(1 — a)(m— 2)J 

Sa{m) = ^ Fib{i)Fib{m — z — 1) + ^ Fib{2i — l)Fib{m — 2z — 1) 

i=l i=l 

= -y[(w — l){Fib{m) + Fib{m — 2)) — Fib{m — 1)] + 
o 

[(l-a)(m-2)J 

+ ^ Fib{2i — l)Fib{m — 2i — 1) 

i=l 

= — l){Fib{m) + Fib{m — 2)) — Fib{m — 1)] 

o 

[(1-q)(to-2)J 

+ ^ Fib{2i — l)Fib{m — 2i — 1) + Fib{m — a) + 2Fib{m — 5) 

i=3 



Hence, one can find the desired value of the average path length by observing 
that: 



Sg{m) 

Fib{m) 



1 


■ 


f 1 \ 


1' 


<5 


(m — 1) 


+ 
1 — I 





(1 — a){m — 2)Fib{5)Fib{m ~ 7) ^ ^ 
Fib(jn) 



The following are immediate corollaries of the above Theorems 2 and 3. 

Corollary 1 The F-Chord(l) scheme (=Fib-Chord) has degree 1.440421og — 

2, diameter equal to 0.72021 logfV, and average path length equal to 0.39812 log fV. 



Corollary 2 The F-Chord{l/2) scheme has degree and diameter both equal to 
0.72021 logfV and the average path length is 0.52215 logfV. 



We notice that F-Chord(l/2) meets the bound in Theorem 1. 

Corollary 3 For each < a S [0.58929,0.69424], the Fa-Chord(a) and F^-Chord 
(a) schemes improve on Chord in all parameters (number of jumps, diameter, 
and average path length). 
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5.3 Edge Congestion 

We remind the reader that, by using a uniform scheme, there is no node con- 
gestion. Therefore, here we focus on edge load, that is defined as the number of 
times it is used by all routes from every node to every other node. 

Analysis of Li(i,m) and Lij 2 {i,rn) shows that Li/ 2 (utn) is decreasing with 
i > 1, whereas Li(i,m) decreases until it reaches its minimum at t = [m/2], 
after which it is increasing. Hence, the maximum and the minimum loads are 
given by 

max{Li{i, m), Ti/ 2 (t, rn)} = Li/2(2, m) = Fib{m — 1) -I- Fib{m — 3) 

min{Li{i,m),Li/ 2 {i,Tn)} = Li(|'m/2] , m) = Fib{\m/2\ — l)Fib{\m/2\), 

and the ratio between the two is bounded by 9/4, which can be viewed as the 
worst possible congestion. On the other hand, in the literature congestion is 
sometimes defined as the maximum load divided by the average load. With this 
definition the congestion for the F-Chord(a) scheme lies between gi /2 9a 
gx, and the boundaries are swiftly calculated to gi /2 = 1.18034 and g\ = 1.38197. 

5.4 A Comment on Scalability 

In the case of F-Chord(l) and F-Chord(l/2), with n< N nodes in the network, 
results can be easily rewritten in terms of n with high probability, as already 
noticed in [15,16]. When choosing a parameter a G (1/2,1), the presence of 
n < N nodes will automatically tune (due to the fact that the shortest jumps 
will point to the same node): to aa{n) > a if one is using Fa-Chord(a); to 
ab{n) < a if one is using Fh-Chord(a). Thus it is possible to choose which 
member of the family is most suitable for the requirements of the application. If 
a lower average path length is desired one can choose Fa-Chord(a) (here, fixing 
a assures an upper bound of[a(m — 2)] on the node degree for large values of n) 
while if a low degree is the main goal one can choose Ff,-Chord(a) (in this case 
fixing Of assures an upper bound on the average path length as in Theorem 3). 

6 Conclusions and Open Problems 

We have described a family of simple algorithms that (1) improves uniform 
routing on Chord and (2) is of practical interest. In fact, the designer can choose 
which member of the family is most suitable for the requirements of the appli- 
cation. For example, in a distributed file-system application one may want to 
prefer lower average path length over worst case (and thus choose F-Chord(l)), 
while in an application where fast delivery is paramount, one would choose a 
faster worst case over the average (i.e. F-Chord(l/2)) with a whole range of 
intermediate choices by using appropriate values of a. 

Since any greedy routing requires l7(logfV) hops when the degree is logarith- 
mic [8], we believe that it is meaningful to improve the multiplicative constants 
in front of the logn since the results obtained by deterministic and uniform 
algorithms must be compared, practically, to the more theoretically appealing 
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0(log-/V/loglog A^) that, for some values of n could have performances compa- 
rable to the deterministic and uniform algorithms (that are much much simpler 
to realize and deploy). 

It eluded us to find an optimal average path length routing algorithm (once 
the node degree anf the uniformity requirements are fixed) . The search schemes 
used in F-Chord is close to optimal but not for all values of N. 
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Abstract. We consider a 2-edge connected, undirected graph G = 
{V, E), with n nodes and m non-negatively real weighted edges, and a sin- 
gle source shortest paths tree (SPT) T of G rooted at an arbitrary node 
r. If an edge in T is temporarily removed, it makes sense to reconnect the 
nodes disconnected from the root by adding a single non-tree edge, called 
a swap edge, instead of rebuilding a new optimal SPT from scratch. In 
the past, several optimality criteria have been considered to select a best 
possible swap edge. In this paper we focus on the most prominent one, 
that is the minimization of the average distance between the root and 
the disconnected nodes. To this respect, we present an 0(m log^ n) time 
and linear space algorithm to find a best swap edge for every edge of T, 
thus improving for m = o (n^/ log^ n) the previously known O(n^) time 
and space complexity algorithm. 

Keywords: Network Survivability, Single Source Shortest Paths Tree, 
Swap Algorithms. 



1 Introduction 

One of the most common operations in computer networks is the broadcasting 
of a message from a source node to every other node of the network. When the 
broadcasting procedure represents an important part of the network activity, 
it makes sense to use a topology that enables the information to reach all the 
nodes in the fastest possible way, that is a single source shortest paths tree (SPT) 
rooted in the source node. The SPT, as any tree-based network topology, may 
present some problems regarding link malfunctioning, however: the smaller is the 
number of links, the higher is the average traffic for each link and, consequently, 
the higher is the risk of a link overloading. Furthermore, the failure of a single 
link may cause the disconnection of a wide part of the network. 

Two different approaches can be followed to solve the problem of a link 
failure: either rebuilding a new SPT from scratch, or using a single non-tree 
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by the Italian Ministry of Education, University and Research. 
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edge (called a swap edge) to replace the failing link and reconnect the network, 
thus obtaining the so-called swap tree. The former guarantees the construction 
of a most efficient network on the graph without the failing link, but it is very 
expensive both in terms of set-up costs and of time complexity for computing a 
new SPT [8]. Indeed, a new SPT may be completely different from the existing 
one, and therefore the updating of a large amount of nodes may be necessary. 
Furthermore, the most efficient known algorithm for computing an SPT has an 
0{m + nlogn) time complexity, and therefore the precomputation of a new SPT 
for every possible link failure in the network requires an 0{mn + n? log n) time 
complexity. Thus, if the failing link is supposed to be quickly restored, and in 
any other case in which changes on the network structure are expensive, it may 
be preferable to use a swap edge, thus minimizing the number of nodes to be 
updated, rather than rebuilding a new SPT. 

Several different functions have been described in [5] to characterize a possi- 
ble best swap edge. In particular, among all the defined functions, the average 
distance from the root represents the most suitable choice to find a swap edge 
that matches the criterium on which the original SPT is based: reaching every 
node from the root by following a shortest possible path. It is worth noticing 
that by adopting such swap strategy, the average stretch factor of the swap tree 
w.r.t. a new SPT (i.e., the average ratio between the distance from the root 
to a disconnected node in the swap tree and in a new SPT) is bounded by 3 
[5]. Moreover, experimental results show that the tree obtained from the swap 
is functionally very close to a new SPT computed from scratch [6]. Therefore, 
swapping is not only faster than rebuilding, but it even gives a good network in 
terms of path lengths. 

In this paper we focus exactly on the problem of finding, for every edge of 
the SPT, a swap edge that minimizes the average distance from the root to 
every node in the disconnected area of the network. An O(n^) time and space 
complexity algorithm is known for this problem [5], while an 0{m + n) time 
and space one is described in [7] for the simpler problem of finding a best swap 
edge for a single failing link. In this paper we improve the former bound by 
presenting an O(mlog^n) time and linear space complexity algorithm, running 
in the standard RAM model. In this way, we obtain a better space complexity 
for every instance of the problem, and we reduce the time complexity for graphs 



The paper is organized as follows: in Section 2 we give some basic definitions 
that will be used throughout the paper; in Section 3 we present our algorithm; 
in Section 4 we show few results that will form the basis for the time complexity 
analysis of our algorithm; finally, in Section 5, we will formally analyze our 
algorithm. 

2 Basic Definitions 

Let G = {V, E) be a graph in which V is the set of nodes and E is the set of edges; 
every edge e = (u, v) has associated a non-negative real weight w{e), also denoted 



in which m = o 
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as w{u,v). A simple path is a sequence of nodes p = {ui,U 2 , ■ ■ .Uk) = u\ ^ Uk 
such that (ui, Ui+i) G E, i G [1, fc — 1], and Ui ^ Uj for every i,j G [1, k],i yf j; if 
Ml = Uk the path is called a cycle. Let w{p) = w{ui, Ui+i) be the length of 

the path p. The distance d{u, v) between the nodes u, v is defined as the length 
of a shortest path connecting the nodes. A connected graph is a graph in which 
every couple m, m of nodes has a path m — > m linking them; a graph is said to be 
2-edge connected if the deletion of any edge leaves the graph connected. 

A tree is a connected graph without any cycle. If we root a tree T at an 
arbitrary node r, then a node x is said to be an ancestor of a node y in T if the 
path from r to y in T contains x; in this case, y is a descendant of a;. A single 
source shortest paths tree (SPT) of G rooted in r is a tree T = (V, Et) made up 
by shortest paths linking r with every other node of the graph. Let Ty = (14, Ey) 
be the subtree of T rooted at m. In the rest of the paper, we will denote by |T„| 
the number of nodes in Ty, and by w{Ty) = d{v, x). The nearest common 

ancestor (NCA) of a given couple of nodes x, y is the lowest node of the tree 
that is an ancestor of both x and y (for details see [3]); we will indicate it by 
nca{x, y). 

Let e = (u, v) be any tree edge, with u parent of v in T . We indicate with 
Ce = {(a;, y) G E \ Et\ {x gV \ 14) t\ {y G 14)} the set of swap edges for e, i.e., 
the edges that may be used to replace e for maintaining the tree connected. In a 
2-edge connected graph, we have that Ce yf 0, Ve G Et- Given a non-tree edge 
/, let life{f) = {e G Ex\f G Ce}, i.e., the set of tree edges for which / is a swap 
edge; we will indicate life{a) n life{P) by writing life{a,P). For any / G Ce, let 
and df{x,y) be the tree obtained by swapping e with / and the distance 
between nodes x,y in ?}//) respectively, and let F{e,f) = '^/(c^;). Let 

4>{e,f) = F{e, f)/\Ty\ denote the average distance between r and nodes in Ty. 
Notice that |T„| does not depend on the swap edge, thus finding a swap edge 
minimizing (j) is equivalent to finding a swap edge minimizing F. Hence, a best 
swap edge f* for a tree edge e is defined as 

/* = argmin{F(e, f)\f G Ce}. 

In the sequel, we will study the problem of finding, for every edge of the SPT, a 
best swap edge minimizing the function F, and then minimizing the function cj). 



3 High-Level Description of the Algorithm 

Let us first recall the definition of lower envelope of a set of functions: 

Definition 1. Let T = |/i(x), . . . , fk{x)} be a set of functions, where fi : x G 
Di C K I— > K. The function 

k 

Ljr : X G Djr = [J Ci I— !■ min {fi{x) \ i G [l,k] Ax G Di} G K 

i=l 



is named the lower envelope of T . 
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Let us consider the nodes of the tree as ordered in any arbitrary post-order. 
For every tree edge e = {u,v), and for every non-tree edge a, let the swap 
function associated with a be defined as follows: 

r f if O € Ce—{u^v)-! 

[undefined otherwise. 

If we arrange the nodes on the x axis and the values of the fa functions on 
the y axis, then solving our problem reduces to finding the lower envelope of 
IF = {/o,(u) I q; is a non-tree edge}. 

In [4] it was presented an efficient comparison-based algorithm to compute 
the lower envelope of a set of continuous real functions. Such algorithm relies 
on the concept of intersection between functions, and can be extended to our 
case by introducing the notion of inversion between pairs of swap functions. 
Basically, an inversion between two swap functions fa,f /3 with life{a,P) yf 0, is 
defined by a couple of tree edges e = {u, v), e! = (u, z) € life{a, f3) such that u is 
the parent of v and v is the parent of z in T, and for which the following holds: 
either /„(z) < fpiz) and /„(u) > fp{v), or /„(z) > fp{z) and fa{v) < fp{v). 
Node V is said to be an inversion node for a, (3. 

The algorithm provided in [4] relies on the efficient answering to a set of three 
queries that, for our problem, can be rephrased as follows: 

Qi: Given a swap function /„, return the minimum and the maximum node (with 
respect to the selected post-order numbering) for which /„ is defined; 

02 : Given a couple of swap functions and a node v G V, return their minimum 
inversion node greater than v (w.r.t. the current post-order), if any; 

03 : Given a couple of swap functions fa, f /3 and a node v belonging to the path 
induced by life{a,(i), compare /a(u) and f/ 3 {v). 

In the following section, we show how to answer efficiently to these queries, 
and we provide a bound on the number of inversions of a couple of functions 
fa,fp- 



4 Answering the Queries 

Given two swap edges a = (ya, Xa),/3 = {yp, xp), observe that either life{a, (3) is 
empty, or it consists of a set of edges which form a path in T. In the latter case, 
such a path consists either of: (i) a subpath of a root-leaf path in T, or (ii) two 
edge-disjoint subpaths of T which start from the node nca{ya, Xa) = nca{yp, xp) 
and proceed downwards. Notice that case (ii) can be easily reduced to case (i) 
(it suffices to transform the swap edge a into edges a' = {ya,nca{ya,Xa)) with 
w(a') = w(a) + d(r,Xa), and a" = (xa,nca(ya,Xa)) with w{a") = w{a) + 
d{r,ya), and similarly for (3). Hence, in the following we will assume, for the 
sake of simplicity, that life{a, (3) = {cq, . . . , Cp}, where precedes e^+i along a 
root-leaf path of T, i = 0, . . . ,p — 1. 

Trivially, query 0i can be answered in constant time, since the nodes we are 
searching for coincide with the endnodes of the given swap edge. Goncerning the 



Swapping a Failing Edge of a Shortest Paths Tree 103 



query Q 2 , performing it efficiently is harder. In the following, given two swap 
edges Of,/3, if the lower endpoint of life{a,l3) coincides with any endvertex of a 
or (3, then we say the two edges are related, unrelated otherwise. Next, we show 
separately how to manage related and unrelated swap edges. 



4.1 Related Swap Edges 

We start by proving two lemmas concerned with related edges: 

Lemma 1. Let a = {ya,Xa),(3 = {yp,xi 3 ) he related swap edges such that 
life{a,(3) = {eo,...,6p = (up_i,Up)}, where xp = Vp is an ancestor of Xa- 
If there exists an edge eu € life{cx, /3) such that F{ek,P) < F{ek,a), then 
dp{r,xp) < da{r,xp). 

Proof. In the following, we adopt the following notation (see Figure 1). Let 
{Cp+i, . . . , Cp+g} be the set of edges in life{a) which lie below Cp. Let Vi be 
the endvertex of farthest from the root, and for i = 0, . . . ,p + q — 1, let 
Si = T„. — i.e., the subtree of Ty. induced by the removal of all the nodes 

in Finally, we set Sp+q = Ty^^^. Then 



p+q 



E(e,,/3)=^u;(^.) + ^ dpijr, xp) + E IS.I+ 



i—k 



i—k 



^=2 + 1 



P+q 



+ XI dp{r,xp)+ ^ w{ei) |S'i|; 



i=p+l 



i=p+l 



p+q 



P+q 



q:) — u'(iS'i) + I dct(?', ;Cq,) + w(e£) + E »(«) 1SI+ 



i—k 



i—k 



i=p+l 



t.=i+l 



P+q 



+ X da(r,Xa)+ ^ w{ei) |S'i|. 



i=p+l 



t.=p+q 



Recalling that, by definition, da{r, xp) = da{r, Xa)+Y+t=p+i and assuming 

by contradiction that dp{r,xp) > da{r,xp), we have 



p+q 



p+q 



Fifik, P) > ^ ^ Ul[Si) + ^ ^ I da (r , Xa) + ^ ^ w{e£) + ^ w{ei) I \S^\+ 



i—k 



i—k 



P+q 

+ X 



i=p+l 



p+q 



da{r,Xa)+ X ++F X 

l=p+l t=p+l 



|S'i| > F{ek,a), 



thus contradicting the assumptions. 



□ 
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Fig. 1. Non-tree edges a, f3, where the endnode Xa of a is a descendant of the endnode 
Xj 3 of /3; edges a, f) are both swap edges for the tree edges {eo, . • . , Cp}. 



The following result derives from Lemma 1: 

Lemma 2. Under the same assumptions of the previous lemma, we have that 
F(ej,P) < F{ej,a),\/j = 1. 

Proof. We have 

k—1 k—1 / p \ 

(3') , /?) F ^ ^ 4 - ^ ^ ( dj^iv^ ^ 0 ) F Y. «-te) |S.I; 

i—j i=j \ / 



k—1 k—1 / p \ 

F{ej,a) = F{ek,a) + dair, Xfs) + E “-(«) isi- 

i—j i—j \ £—i-\-l / 

From the assumptions, we have that F{ek, f3) < F{ek, a). Moreover, from Lemma 
1 we know that dp{r,xi 3 ) < da{r,xp), thus obtaining F{ej,/3) < F{ej,a). □ 

This result introduces an important monotonicity property: if an edge (3 gives 
a better result than a lower one a in swapping an edge e^, then (3 will always 
be better than a for every edge above Cfc. 

The next lemma focuses on the case in which the lower swap edge a gives ini- 
tially a better result than the higher one (3, and provides a condition to establish 
for which failing edge, if any, (3 will became preferable w.r.t. a while climbing 
up the tree. 

Lemma 3. Let a = {ya,Xa),l3 = {yp,xi 3 ) be related swap edges sueh that 
life{a,(3) = {eo,...,ep = (vp-i,Vp)}, where xp = Vp is an ancestor of Xa- 
Let p = xp ^ Xa and let F{ep,a) < F(ep,P). Then, there exists k € [0,p) such 
that F{ek,f3) < F{ek,a) if and only if 



p-i 



i—k 



F{ep,(3) - F{ep,a) 
da(r, Xa) + w(p) - dp(r, xp) ' 



( 1 ) 
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Proof. We write F{ek,P) and F{ek,a) by using the same technique used above: 



p-i p-i / p \ 

F(efc,/3) = F(ep,/3)+^u;(^0 + E dffir, Xf}) + E “(«) lai; (2) 

i—k i—k \ l—i+1 / 



p-1 p-1 / V \ 

F{ek,a) = F{ep,a)+'^w{Si)+'^ ida{r,Xa) + w{p) + E w(e«) j |S'i|. (3) 

i—k i—k V £—i-\-l / 

From (3) and (2), the condition F{ek,P) < F{ek,a) can be rewritten as: 

p—1 

F{ep,l3) + dfj{r,Xf3) E \Si\ < F{ep,a) + {da{r,Xa) + w(p)) E 



i—k 



i—k 



which is equivalent to (1), thus concluding the proof. 



□ 



4.2 Unrelated Swap Edges 

Next, we prove two results concerned with unrelated swap edges. 

Lemma 4. Let a = {ya,Xa),/3 = (yi 3 ,Xjs) be unrelated swap edges sueh that 
life{a,P) = {eo,...,6p = (up_i,fp)}, where both Xa and xp descend from Vp 
in T. If F{ep,a) < F{ep,(3) and F{ek,ff) < F{ek,a) for some k G [0,p), then 
F{ej,l3) < F{ej,a), Vj = 0, . . . , fc - 1. 

Proof. Let Xq, Xb be the children of Vp that are ancestors of Xa,Xfs in T, respec- 
tively; let a = (vp,Xa), b = (vp,Xb), and let p = Xa ^ Xa,p = Xb ^ xp (see 
Figure 2). We start by writing the values of F{ek,a) and F{ek,P). 




Fig. 2. Unrelated swap edges a,/3 and the edges {eo,...,6p} = Ufe{a,f3) that they 
both can swap (splines denote paths). 
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p-i 



F{ek, a) = F{ep, a) + ^ w{S^) 



i—k 



p-1 



+ ^ fia(r,a;a)+w(p)+u>(a)+ ^ w(e^) MS’*!; 



i—k 






p-1 



F{ek,P) = F{ep, /3) + ^ w{S,) 



i—k 



p-1 



+ '^\d/3{r,Xf3) + w{^j,)+w{b)+ ^ w(e^) |5*|. 



i—k 



i^i-\-l 



From the assumptions F{ek,P) < F{ek, a)and F{ep, a) < F{ep, /3), we have that 
the above two equations imply 



w{b) + djs{r, Xjs) + w(/i) < w{a) + da{r, Xa) + w{p). (4) 

Then, the cost of swapping a generic edge e^-, j € [0, fc), with a and (3 is: 

C dair, Xa) + w{p) + w{a) + 

k-l 
i^j 

dp(r, xp) + w{p) + w{b) + 

i=j \ t. 



F(e„/3) = F(efc,/3) + ^ 



k 

E 

=1+1 



w{et) |S'i| 



F{ej,a) = F{ek,a) + ^ 



k 

E 

=1+1 



w{ei) |S'i| 



k-l 

i=3 

We know from the assumptions that F{ek,(3) < F{ck,a), thus we can conclude 
the proof by using (4) . □ 



The next result deals with the same situation of Lemma 4, giving us a con- 
dition to identify, if any, an edge in which (3 gives a better result than a. 

Lemma 5. Let a = (ya,Xa),/3 = (yp,xp) be unrelated swap edges such that 
life{a,l3) = {eo,...,6p = (up_i,Up)}, where both Xa and xp descend from Vp 
in T. If F{cp, a) < F{cp,P), then there exists an edge Ck € life{a,P) such that 
F{ek,(3) < F{ck,a) if and only if 



p-1 

E 

i—k 



l^.l > 



F{ep,(3) - F{ep,a) 



da{r, Xa) + w{p) + w{a) — dp{r, xp) — w{p) — w{b) 



(5) 
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Proof. We can rewrite the condition F(ek,P) < F{ek,ce) in the following way, 
by using a technique very similar to that used above: 

p-i 

F{ep, 13) + (df 3 {r, x/ 3 ) + w{fi) + w{b)^ X] < 

i—k 

p-1 

F{ep, a) + {da{r, Xa) + w(p) + w(a)^ X 

i—k 

which is equivalent to condition (5), thus concluding the proof. □ 



4.3 Answering to Queries Q 2 and Q 3 

The results of Lemmas 1-5 can be summarized by the following: 

Proposition 1. Let a, (3 he swap edges such that life{a,P) = { cq , . . . ,6p}. If 
F(ep,a) < F(ep,(3), then one of the following two conditions must hold: 

1 . F(ei,a) < F(ei,/3) Vz = 0, . . . ,p- 1; 

2. 3j e [0,p)|(^Vz e (j,p], F{e^,a) < F{ei,f3)^ A (yi € F{e^,a) > 

F{e,,p)). □ 

Hence, the following two cases are possible: 



1. If the edges are related, then from Lemma 3 the inversion node if it exists, 
must satisfy the following condition: 






F{ep,(3) - F{ep,a) 
da{r,Vj) + w{p) - dp{r,Vj) ' 



( 6 ) 



2. Otherwise, if the edges are unrelated, then from Lemma 5 the following must 
hold to ensure the existence of the inversion node vf 



\Fvj I > \Tv, 



F{ep,P) - F{ep,a) 



da{r, Xa) + w(a) + w(p) — w(b) — dp(r, xp) — w{p) 



( 7 ) 



Thus, finding the inversion node reduces to searching for the lowest node with 
a certain number of descendants on a given path. Concerning the evaluation of 
conditions (6) and (7), in [7] it has been proved the following: Let e = {u,v) be 
a tree edge and / = {x,y) G C^, and let w{T,x) {w{T,v), respectively) denote 
the sum of the lengths of all the paths in T starting from x {y, respectively) and 
leading to every node in T; then, the function F(e, f) can be written in this way: 



F{e, f) = w{T, x) - w{T, v) + w(T„) - d{v, x) • (|T| - |T„|) -f |T„| • (d(r, y) + w{f)) . 

Terms |T„| and d{r,y) can be easily obtained in constant time, after a linear 
time preprocessing of T. Moreover, in [7] it is also proved that it is possible to 
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compute w{T,x) for every node x G V in 0(n) time. From this, it follows that 
we can compute F{ep, a) and F{ep, f3) in constant time. Since all other terms in 
(6) and (7) are available in 0(1) time after linear time preprocessing, it follows 
that these two conditions can be evaluated in 0(1) time. 

The above analysis shows that to establish the query time for Q2, it remains 
to bound the number of times conditions (6) and (7) need to be tested to find 
their inversion node. We can prove the following result: 

Lemma 6. (Query time for Q 2 ) Let a = {ya,Xa),f3 = {yis^xp) he swap edges 
such that life{a,l3) = {eo,...,ep = (fp_i,z;p)}, where both Xa and xp descend 
from Vp in T. Then, it is possible to compute their inversion node, if any, in 
0(log n) time and linear space, after linear time preprocessing. 

Proof. Fixed any node Vj in life{a,(3), we can evaluate conditions (6) and (7) in 
0(1) time. Hence, we have to search for the lowest node in life{a, f3) with the 
needed number of descendants. 

First of all, we compute the endpoints of life{a,P), which can be easily ex- 
pressed in terms of NCA queries, by noticing that the lower endpoint is given 
by nca{xa,xp), while the higher one is the lowest node among nca{xa,ya) and 
nca{xp,yp). In this way, we also realize whether a, (3 are related or not. In [3] it 
is showed a technique to find the NCA of a pair of nodes in constant time, after 
linear time preprocessing. Thus, it is possible to find the endnodes of life {a, (3) 
in 0(1) time, after linear time preprocessing. Afterwards, we execute a binary 
search over the path life{a, (3) in the following way: we first jump up to node vq 
and we check if it satisfies the appropriate condition; if not, there is no inver- 
sion node, and we are done. Otherwise, we have to search in the path vg Vp. 
To this aim, we jump up to the node Up/2, and we verify if it satisfies the ap- 
propriate condition: if yes, we continue to search in Vp /2 Vp-, otherwise we 
look at vq — > Vp/ 2 - By proceeding in this way, we can find the inversion node 
in O(logn) steps. In [2] it is illustrated a technique to jump up an arbitrary 
number of nodes in a tree in 0(1) time, after linear time preprocessing, hence 
we can find the inversion node in O(logn) time and linear space, after linear 
time preprocessing. □ 

On the other hand, concerning the query time for Q3, we can prove the 
following: 

Lemma 7. (Query time for Q3) Let fa, fp be the swap functions of the swap 
edges a, (3, and let life{a, (3) = {eg , . . . , Cp}. Then, it is possible to compare fa(vi) 
and fp{vi), i G [0,p], in O(logn) time, after linear time preprocessing. 

Proof. We evaluate the functions in Vp, and we compute the inversion node 
Vj of fa and fp, if any, in O(logn) time, as shown in Lemma 6. W.l.o.g., let 
fa(vp) < fp(vp). From Proposition 1, we know that fa, fp invert in at most one 
point, hence fa{vi) < fp{vi) i<j. □ 
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5 Analysis of the Algorithm 

To analyze our algorithm, we start by recalling some important results about 
the relationships between Davenport-Schinzel sequences and the lower envelope 
sequence of a set of functions (for details see [1] and [4]): 

Definition 2. Let k, s G N'*'. A sequence U = (mi, . . . Up) of integers Ui G [1, k] 
is a Davenport-Schinzel sequence of order s with k elements, say DS(k,s), if 
Ui yf Ui+i,\/i < p, and there exist no s + 2 indexes 1 < h < • • • < is +2 < P such 
that 

ttq = Ui^ = Ui^ = • • • = = a, Ui 2 — Ui^ = Ui^ = • • • a ^ b. 

The complexity of a DS{k,s) is defined as Xs{k) = max{|C/| | U G DS{k,s)}. 

Definition 3. Let T = {fi{x ), . . . , fk(x)} be a set of functions, and let Lj^{x) 
denote the lower envelope of T . Let p be the minimum number of intervals 
Ii, . . . , Ip in which Djr (the domain of Ljr[x) ) can be partitioned and such that 

Vj G [l,p] I Ljr{x) = fij{x), \/x G Ij. 

Then, the lower envelope sequence of T is the sequence CTf = (h, ■ ■ • ,*p)- 

We are now ready to prove the main result: 

Theorem 1. Given a 2-edge connected, undirected graph G = (V,E), with n 
nodes and m non-negatively real weighted edges, and given a SPT T of G, the 
problem of finding, for every tree edge, a best swap edge which minimizes the 
average distance between the root ofT and the disconnected nodes, can be solved 
on a standard RAM model in 0(m log^ n) time and linear space complexity. 

Proof. Let T he & set of k functions intersecting pairwise in at most s points. 
Then, the lower envelope sequence of IF is a DS{k, s-|-2) [1]. Moreover, the lower 
envelope of T can be computed by performing 0(Ag+i (fc) log k) comparisons and 
by using linear space [4] . 

In our case, T = {fa{v) | a is a non-tree edge}. Since we have k = m — n 1 
swap functions, and given that from Proposition 1 every pair of swap functions 
has at most s = 1 inversions, it turns out that to compute the lower envelope of 
T we need 0(A2(m)logm) comparisons. Since \ 2 {m) = 2m -|- 1 [1], this means 
we need 0(m log m) comparisons. In every comparison, we have to answer any 
of the queries described in the previous section, and so from Lemmas 6 and 7, 
we know that every query can be answered in O(logn) time after linear time 
preprocessing. It follows that the algorithm has O(mlog^n) time complexity. 
Both the algorithm and the precomputations use linear space, and then the 
space complexity is linear. □ 
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Abstract. A black hole is a harmful host that destroys incoming agents 
without leaving any observable trace of such a destruction. The black 
hole search problem is to unambiguously determine the location of the 
black hole. A team of agents, provided with a network map and executing 
the same protocol, solves the problem if at least one agent survives and, 
within finite time, knows the location of the black hole. 

It is known that a team must have at least two agents. Interestingly, two 
agents with a map of the network can locate the black hole with 0{n) 
moves in many highly regular networks; however the protocols used apply 
only to a narrow class of networks. On the other hand, any universal 
solution protocol must use f?(nlogn) moves in the worst case, regardless 
of the size of the team. 

A universal solution protocol has been recently presented that uses a 
team of just two agents with a map of the network, and locates a black 
hole in at most 0(n log n) moves. Thus, this protocol has both optimal 
size and worst-case-optimal cost. We show that this result, far from clos- 
ing the research quest, can be significantly improved. 

In this paper we present a universal protocol that allows a team of 
two agents with a network map to locate the black hole using at most 
0{n-\- dlogd) moves, where d is the diameter of the network. This means 
that, without losing its universality and without violating the worst-case 
l7(nlogn) lower bound, this algorithm allows two agents to locate a 
black hole with 0{n) cost in a very large class of (possibly unstructured) 
networks. 

Keywords: Distributed Algorithms, Distributed Computing, Mobile 
Agents, Harmful Host, Undetectable Failure, Size and Cost Optimal Pro- 
tocols 



1 Introduction 

In networked systems that support autonomous mobile agents, a main concern 
is how to develop efficient agent-based system protocols; that is, to design pro- 
tocols that will allow a team of identical simple agents to cooperatively perform 
(possibly complex) system tasks. Example of basic tasks are makeup, traversal, 
rendez-vous, election. The coordination of the agents necessary to perform these 
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tasks is not necessarily simple or easy to achieve. In fact, the computational 
problems related to these operations are definitely non trivial, and a great deal 
of theoretical research is devoted to the study of conditions for the solvability of 
these problems and to the discovery of efficient algorithmic solutions; e.g., see 
[1-8,13], 

At an abstract level, these environments can be described as a collection 
E of autonomous mobile agents (or robots) located in a graph G. The agents 
have computing capabilities and bounded storage, execute the same protocol, 
and can move from node to neighboring node. They are asynchronous^ in the 
sense that every action they perform (computing, moving, etc.) takes a finite 
but otherwise unpredictable amount of time. Each node of the network, also 
called host, provides a storage area called whiteboard for incoming agents to 
communicate and compute, and its access is held in fair mutual exclusion. The 
research concern is on determining what tasks can be performed by such entities, 
under what conditions, and at what cost. 

At a practical level, in these environments, security is the most pressing 
concern, and possibly the most difficult to address. Actually, even the most 
basic security issues, in spite of their practical urgency and of the amount of 
effort, must still be effectively addressed (for a survey, see [14]). 

Among the severe security threats faced in these environments, a particu- 
larly troublesome one is a harmful host; that is, the presence at a network site of 
harmful stationary processes. This threat is acute not only in unregulated non- 
cooperative settings, such as Internet, but also in environments with regulated 
access and where agents cooperate towards common goals. In fact, a local (hard- 
ware or software) failure might render a host harmful. The problem posed by 
the presence of a harmful host has been intensively studied from a software engi- 
neering point of view (e.g., see [12, 15, 16]), and recently also from an algorithmic 
prospective [9-11]. 

Obviously, the first step in any solution to such a problem must be to identify, 
if possible, the harmful host; i.e., to determine and report its location. Following 
this phase, a “rescue” activity would conceivably be initiated to deal with the 
destructive process resident there. Depending on the nature of the danger, the 
task to identify the harmful host might be difficult, if not impossible, to perform. 

A particularly harmful host is a black hole: a host that disposes of visiting 
agents upon their arrival, leaving no observable trace of such a destruction. The 
task is to unambiguously determine and report the location of the black hole, 
and will be called black hole search. The searching agents start from the same 
safe site and follow the same set of rules; the task is successfully completed if, 
within finite time, at least one agent survives and knows the location of the 
black hole. Note that this type of highly harmful host is not rare; for example, 
in asynchronous networks, the undetectable crash failure of a site will transform 
that site into a black hole. 

Black hole search is a non trivial problem, its difficulty aggravated by the 
combination of absence of any trace of destruction (outside the black hole) and 
asynchrony of the agents. It has been investigated focusing on identifying con- 
ditions for its solvability and determining the smallest number of agents needed 
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for its solution [9-11]. Some conditions are very simple; for example, the graph 
G must be biconnected, the team must consist of at least two agents, and n must 
be known. A complete characterization has been provided for ring networks [10]. 
Our interest is in universal (or generic) solution protocols, i.e. protocols that can 
be used in every biconnected network. 

The efficiency of a solution protocol is measured first and foremost by the 
number of agents used by the solution. This value, called size, depends on many 
factors, including the topology of the network, the amount of a priori information 
the agents have about the network, etc. In particular, in an arbitrary network, 
if the topology of the network is known, two agents suffice ! [11]. Indeed, this 
surprising result can be achieved by several protocols. The second efficiency 
measure is the number of moves, called cost, performed by the agents. Clearly 
the research interest is in the design of size-optimal universal solutions (i.e., using 
a team of just two agents) that are also cost-efficient. 

Sometimes the network has special properties that can be exploited to obtain 
a lower cost network-specific protocol. For example, two agents can locate a black 
hole with only 0(n) moves in a variety of highly structured interconnection 
networks such as hypercubes, square tori and meshes, wrapped butterflies, star 
graphs [9]. On the other hand, there are networks where l?(nlogn) moves are 
required regardless of the number of agents; such is for example the case of ring 
networks [10]. The lower bound for rings implies an l7(nlogn) lower bound on 
the worst case cost complexity of any universal protocol. 

Indeed, a universal solution protocol has been recently presented that has 
both optimal size and worst-case-optimal cost. In fact, it uses a team of just two 
agents with a map of the network, and locates a black hole in at most 0(n log n) 
moves [11]. Surprisingly, this result does not close the research quest. 

In this paper, we show that it is possible to considerably improve the bound 
on cost without increasing the team size. In fact, we present a universal protocol. 
Explore and Bypass, that allows a team of two agents with a map of the network 
to locate a black hole with cost 0{n -\- dlogd), where d denotes the diameter of 
the network. 

This means that, without losing its universality and without violating the 
worst-case l7(nlogn) lower bound, this algorithm allows two agents to locate 
a black hole with 0{n) cost in a very large class of (possibly unstructured) 
networks: those where d = 0(n/logn). 

Importantly, there are many networks with 0(n/logn) diameter in which 
the existing protocols [9, 11] fail to achieve the 0(n) bound. A simple example 
of such a network is the wheel, a ring with a central node connected to all ring 
nodes, where the central node is very slow: those protocols will require 0(n log n) 
moves. 

2 Definitions and Terminology 

2.1 Framework 

Let G = (V,E) be a simple biconnected graph; let n = [H | be the size of G and 
d be its diameter. At each node x, there is a distinct label (called port number) 
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associated to each of its incident links (or ports); let Xx{x,z) denote the label 
associated at x to the link (x, z) G E, and A^; denote the overall injective mapping 
at X. The set A = {Xx\x G F} of those mappings is called a labelling and we 
shall denote by (G, A) the resulting edge-labelled graph. The nodes of G can be 
anonymous (i.e., without unique names). 

Operating in (G, A) is a set A of distinct autonomous mobile agents. The 
agents can move from node to neighbouring node in G, have computing capa- 
bilities and bounded computational storage, obey the same set of behavioral 
rules (the protocol). The agents are asynchronous in the sense that every action 
they perform (computing, moving, etc) takes a finite but otherwise unpredictable 
amount of time. Initially, the agents are in the same node h, called home base. 
Each agent has a map of the labelled graph (G, A) where the location of the 
home base is indicated. 

Each node has a bounded amount of storage, called whiteboard; O(logn) bits 
suffice for all our algorithms. Agents communicate by reading from and writing 
on the whiteboards; access to a whiteboard is gained fairly in mutual exclusion. 

2.2 Black Hole Search 

A black hole is a node where resides a stationary process that destroys any agent 
arriving at that node; no observable trace of such a destruction will be evident 
to the other agents. The location of the black hole is unknown to the agents. 
The Black Hole Search problem is to find the location of the black hole. More 
precisely, the problem is solved if at least one agent survives, and all surviving 
agents know the location of the black hole. 

The main measure of complexity of a solution protocol V is the number of 
agents used to locate the black hole, called the size of V. Clearly, at least two 
agents are needed to locate the black hole. On the other hand, two agents with 
a map of the network suffice to locate the black hole. 

The other measure of complexity of a protocol V is the total number of moves 
performed by the agents, called the cost of V. 

2.3 Exploration 

Let T be a spanning-tree of G rooted in the home base of the agents. For each 
node V we define T„ to be the subtree of T rooted at v. We will slightly abuse 
the notation and use to mean both the subtree rooted at v and the set of 
the nodes of this subtree. For a given we define the components of T„ to be 
the connected components of the graph induced in G by the nodes of \ {u}. 
Clearly, each component of T„ is a union of one or more of its subtrees (plus 
the connecting edges). As usual, l^l means the size of the set S (we will almost 
exclusively talk about sets of nodes, sometimes about set of edges) . For a given 
node V the level of v is defined as its distance (in T) from the root. Given two 
nodes v and w, denotes the unique directed path from u to w in T. For a 
given path tt we denote by the reversal of this path. Given a node v and 
a set S' = {ei, 62, . . . , Cfe} of edges of T, we denote by G*' \ S the connected 
component containing v obtained from T by removing the edges in S. 
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We say that a node is unexplored if it has not been visited by an agent. A 
node V is explored if all the nodes of have been visited. A node v is safe if it 
has been visited, but there are unexplorednodes in T„. 

Similarly, each port can be classified as: (1) unexplored - no agent has yet 
arrived/departed via that port; (2) dangerous - an agent has left via this port, 
but none has arrived from there; (3) used - an agent arrived via that port and 
(4) explored - the port is used and all nodes in the corresponding subtree has 
been explored. Obviously, a used port does not lead to black hole; on the other 
hand, both unexplored and dangerous ports might lead to it. 

To ensure that at least one agent survives, we will not allow an agent to 
leave through a dangerous port. To prevent the execution from stalling, we will 
require any dangerous port not leading to the black hole to be made used as soon 
as possible. This is accomplished using the following technique called Cautious 
Walk [9-11]: 

Cautious Walk 

Whenever an agent a leaves a node u through an unexplored port p (transforming 
it into dangerous) leading to a node u, upon its arrival and before proceeding 
somewhere else, a returns to u (transforming that port into used). Node u will 
be called the last safe node of a until a is back in u to make the port used. 

If agent b reaches the last safe node m of a (i.e., a has not returned yet from v), b 
will perceive a as being blocked on (u, v) (or simply, blocked at v); this perception 
will persist until 6 becomes aware that a has indeed returned from v. 

3 Algorithm Explore and Bypass 

The proposed algorithm Explore and Bypass ( SSzB ) is a rather complex protocol 
that uses several quite different techniques and strategies; it allows a team of 
just two agents with a map of the network to solve the Black Hole Location 
using at most 0{n + dlogd) moves. 

In the following we will first describe the overall strategy. The structure of the 
main modules and the algorithmic techniques they employ are described next. 
The full description can be found in the appendix, as the handling of numerous 
cases does not fit in the limited space available. The overall structure and the 
major modules of Algorithm are captured in Fig. 1. 

In assessing the overall cost, our goal will be to charge the moves caused 
by each module to the nodes explored during the execution of that module. As 
most of the activities consist of traversals of unexplored nodes (and possibly a 
second traversal of newly explored nodes), this works quite well. The activities 
that cannot be easily accounted for in this way are counted as overhead and the 
cost is charged to a higher level module. 

3.1 Overall Strategy 

In Algorithm £hB , the two agents, a and b, cooperatively explore the network to 
locate the black hole. The exploration is achieved by the two agents performing 
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Fig. 1. The overall structure of the Algorithm £SzB. 



mainly a cooperative depth-first traversal of a spanning-tree T of the network, 
using the other links as bypasses if needed. Special traversal techniques are 
employed in some particular cases to avoid an unnecessary increase in the overall 
cost of the protocol. The tree T used is a breadth-first spanning-tree rooted in 
the home base of the agents; this choice is made for efficiency reasons that will 
become apparent later. 

The algorithm starts with both agents cooperatively exploring the tree T 
(module CoExplore) without passing through non-T edges. Eventually, no more 
nodes can be explored without using non-T edges: an agent a arrives at a node 
u and discovers that the only non-explored edge is a dangerous edge (u, v) (thus, 
b is blocked at v). When this occurs, there are several possible actions that a 
may take. 

The decision on which action a will take (procedure MainDispatch()) depends 
on the structure of the set U of still unexplored nodes: (1) If C/ is small (|C/| < d), 
a will execute procedure ExploreSmall(). (2) If the largest unexplored subtree 
Tjn hanging from v is not too big {\U \ Tm\ > d), a will execute procedure 
BypassNode(); otherwise (3) a will execute procedure BypassHardNode(). Agent 
a informs b about the decided action by leaving a message in the whiteboard 
at u. 

If agent b returns to u, it will discover a’s decision, and coordinate its ac- 
tivity with a. The goal is to re-establish a situation similar to the initial one, 
where both agents cooperatively explore a single (sub)tree (and eventually one 
of them perceives the other as blocked on a link) . Although the goal might not be 
achieved, the agents will always return to a configuration when MainDispatch() 
can be applied again: 

Property UVW: An agent (say, b) is blocked on (u,v), and the other agent 
(say, a) is in u. Either the only unexplored nodes are the nodes of Ty, or there 
exists a neighbour w of v such that the unexplored nodes are the nodes of Ty \ Ty, 
(if u is a child of u and w is a child of v) or Ty \ Ty (if m is a child of v and v is 
a child of w)). 

Summarizing, the two agents perform a cooperative depth-first traversal of a 
breadth-first spanning-tree of the network, until an agent finds the other blocked 
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on a link and all unexplored nodes are on the other side of that link. In this case, 
the agent will either execute ExploreSmall() (a procedure that leads the protocol 
to terminate without any other call to any other procedure), or will bypass the 
blocked agent using either BypassNode() or BypassHardNode(); every time the 
latter cases occur, the level (i.e., the distance from the root) in T of the node 
being bypassed will become larger, so eventually the ExploreSmall() action will 
be chosen. 

3.2 Cooperative Exploration of a Tree 

Procedure CoExplore() is the principal exploration procedure. The algorithm 
starts with both agents executing CoExplore(a;) from the home base x. It is 
a cooperative depth-first traversal in which each agent avoids the part being 
explored by the other (for as long as possible). 

When executing CoExplore(Node u), the agent, say a, determines if there is 
an unexplored subtree T„ of u. If so, it (recursively) explores Ty using cautious 
walk. That is, it will: mark the port from m to u as dangerous] go to v; (if it 
survives) mark the port from u to m as safe, return to u, mark the port from u 
to V as safe, and check for messages - if any, execute the corresponding code, 
otherwise return to v, recursively execute CoExplore(u), and finally mark the 
port from u to u as explored. This process is repeated as long as there is an 
unexplored subtree of u. 

There are now two cases. If all incident edges are explored, the tree T„ does 
not contain the black hole; in this case, the recursive call ends. Otherwise, there 
is only one non-explored (i.e., safe or dangerous) link left, the one leading to the 
subtree where the other agent is still working. Let {u, z) be such a link. If {u, z) is 
safe, a goes to 2 to join the other agent in exploring T^, executing CoExplore(z). 
If (m, z) is dangerous, the co-exploration cannot continue, and the agent executes 
MainDispatch(u, z, nil) to decide which action to take. 

The cost of CoExplore() is linear in the number of nodes explored, as it is 
essentially the traversal of a tree. 

3.3 Exploring a Small Forest 

When an agent finds the other blocked at a node, the agent executes Procedure 
ExploreSmall() if there are at most d unexplored nodes left; i.e. \U\ < d. The main 
idea is that the unexplored part is so small that it can be efficiently handled by 
executing the existing 0(n log n) algorithm of [1 1] on [/. However, that algorithm 
requires that the graph is biconnected, while U might not be. The main task of 
ExploreSmall() is therefore to explore U in such a way that, eventually, the part 
U' of U still left unexplored satisfies the following property: 

Property Small: There is a path tt of length 0{d) in G\U' such that adding 
7T to U' results in a biconnected graph U” . 

The description of how to obtain U' from U using 0(n) moves is rather technical 
and can be found in the appendix. When only U' is left unexplored, the path tt 
is identified and the algorithm from [11] is applied to U” . 
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Since \U\ < d, U' CU and \tt\ € 0{d), also \U"\ G 0{d) and the complexity 
of applying this algorithm to U" is 0{d\ogd). In other words, ExploreSmall() 
requires at most 0{n + dlogd) moves in total by the two agents. 

3.4 Exploring a Forest of Small Components 

If agent a finds agent b blocked at a node v, a must be able to bypass v to 
continue the exploration. BypassNode() is the branch taken in MainDispatch() 
if the largest connected component of C/ \ {u} is not too big (i.e. there are at 
least d other nodes in U). 

As a result of BypassNode(), either v is identified as the black hole, or all 
unexplored nodes will be located in a single subtree hanging from v; in the 
latter case, either PropertyUVW holds or ExploreSmall() is called. We will now 
describe its modules in some details. 

Main Action Procednres BypassNode() and ReleasedB(). When agent 
a finds agent b blocked on the link (u, v), a must be able to bypass v to continue 
the exploration. BypassNode() is the branch taken in MainDispatch() if the 
largest connected component of f7 \ {u} is not too big (i.e. there are at least d 
other nodes in U); if/when b returns to u, it executes procedure ReleasedB(). 

The main idea comes from observing that 1) each unexplored component can 
be reached from m by a safe bypass path /3 (otherwise v would disconnect G) 
and 2) |/3| G 0{d) (from the structure of the explored and unexplored parts). 

The trees (i.e., the connected components) in the forest U \ {u} must be 
reached by a to be explored. To control the amount of moves to reach each of 
those trees, the exploration of the trees will be done in order and with a different 
technique, depending on a the size of the tree. More precisely: 

1. Agent a first explores the largest components (i.e., the ones containing a sub- 
tree of size at least d); the cost of reaching any such tree is 0{d), hence it can 
be “charged” to the cost of exploring that tree (increasing the multiplicative 
constant but not the order of magnitude of that cost). 

2. After all these components have been explored (and assuming agent b is still 
blocked) a proceeds to explore the smaller ones according to the following 
strategy: 

a traverses the whole explored part in one sweep and whenever it finds a 
link leading to an unexplored component C, a explores C, then returns and 
continues with the traversal. If b is still blocked, the only unexplored node 
left is V that is identified as the black hole. 

What makes the situation complex is the fact that b could become unblocked 
at any time; the algorithm must be able to handle correctly and efficiently every 
configuration this fact might cause. This goal is achieved by carefully explor- 
ing the components using a special technique (procedure ExploreComponent) 
described in the next section. 

The activity of agent 6, when (if) it becomes unblocked and finds the message 
<BypassNode>, is prescribed by procedure ReleasedB(): when (if) it is released. 
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b follows the trace of a, until it reaches the last safe node visited by a; b then 
leaves there a message, telling a to finish its component, and goes on exploring 
the remaining unexplored subtrees of v. If a finishes its part, we are exactly 
in the situation where the agents continue the co-exploration of T„ as if no 
bypassing had happened. However, if b finishes first while a is still working on its 
component, the structure of the remaining unexplored part can be quite complex. 
In this case, the unexplored part must be consolidated so that PropertyUVW 
holds and MainDispatch() can be applied. This is accomplished by procedure 
CollapsePath() described later. 

ExploreComponent. To simplify the consolidation (without increasing the 
complexity), the components are explored in a special manner: Consider the 
component graph H where nodes correspond to the subtrees of component C, 
and edges correspond to connections (in G) between subtrees. The component 
graph H is “visited” using a depth-first traversal, where “visiting a node” in H 
means fully exploring the corresponding subtree before going to the next one. 

Such an approach guarantees the following property to hold: 

Property PI: At any moment there is only one partially explored subtree Tyi 
of component C. 

This property will allow 6, when unblocked, to tell a to finish only the current 
subtree T„/, not the whole component C. 

Consider now the partially explored subtree T„/ . Let y be the node in which 
a entered T„/ and let TTyyi be the path from y to the root v' . After agent a 
enters T„/, it performs a DF-traversal of T„/ starting from y and satisfying the 
following restriction: If an edge (h,k) belongs to %„/, it will be the last edge 
explored from h. Such an approach guarantees that the unexplored part of Ty> 
has the following structure: 

Property P2: Let w be the last explored node on the path from y to the root 
v' , and let w' be the next node (if w yf v') on this path. The path from w' to 
v' , together with all subtrees hanging from this path, is unexplored. If a is on 
the link from w to w' , then all subtrees hanging from w are explored; otherwise, 
a subtree of w contains a and is partially explored, while the other subtrees are 
either fully explored, or unexplored. 

Notice that, if b has done all its work and a is still in T„/, property P2 tells us 
that the unexplored part consists just of a path and the trees hanging from this 
path. In order to reach a configuration where PropertyUVW is finally satisfied 
(and, thus MainDispatch() can be called again), this path is explored using the 
procedure CollapsePath() described in the next section. 

As each subtree is traversed at most twice, the complexity of ExploreCom- 
ponent () is linear in the number of explored nodes. 

CollapsePath. Procedure CollapsePath() is called when all subtrees of v except 
one have been explored. The remaining unexplored nodes form a path TTg with 
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unexplored subtrees hanging from this path. The agents divide the remaining 
unexplored path TTg into two disjoint parts with about equal number of nodes 
(the nodes of the hanging unexplored subtrees are counted as well), and each 
agent goes on exploring its part. Note that there is a safe bypass path f3 of 
length 0{d) connecting the opposite ends of TTg : if TTg is in the first subtree T' 
of component C, we use as bypass j3 the path that agent a had used to reach C; 
otherwise, j3 passes through the (already explored) subtree T" from which the 
entry point of T' was reached (note that T” is connected to the already explored 
node v). 

The agent that finishes first uses f3 to reach the other one and split the 
workload again. This is repeated until there is a single unexplored node in the 
path or the number of unexplored nodes becomes less then d. In the first case, 
PropertyUVW is satisfied, in the second case ExploreSmall() is directly executed. 

It may happen (because the agents are also exploring the subtrees hanging 
from the path) that the whole path is explored. In such a case, both agents end 
up co-exploring a single subtree hanging from this path and eventually one of 
them will become blocked, creating a configuration corresponding to the first 
case of PropertyUVW. 

Each round of halving the unexplored path involves traversing a bypass of 
length 0{d). However, as the number of unexplored nodes is about halved at 
each round, and CollapsePath() terminates when there are less then d nodes 
left, the number of rounds with less than d newly explored nodes is 0(1). In 
other words, the overhead of of CollapsePath() is only 0{d). 

3.5 Exploring a Forest with a Large Component 

When agent a must bypass agent b blocked at a node v, if U \ {u} has one 
overwhelmingly large subtree T^, then BypassNode() may be inefficient: It can 
happen that an overhead of 0{d) is incurred, but the one remaining partially 
explored subtree of v is itself but only o(d) new nodes have been explored. 
Since in the worst case, this can happen 0(d) times, the total overhead would 
be an unacceptable O(d^). 

To avoid such an overhead, we must make sure there is enough progress; in 
this way, while the overhead is not avoided, it has no chance to accumulate. This 
is done by procedure BypassHardNode(). 

Let X be the node dividing T\u = U n T (the tree induced in T by the nodes 
of U) into components of size at most |U|/2. BypassHardNode() uses the path 
TTy^x from u to a; as the pivot, i.e. after BypassHardNode() there is either a single 
unexplored subtree of T\u hanging from the fully explored or there is one 
unexplored node w G 7r„^a; and the subtrees hanging from it are unexplored. 

In this way, we are ensured that the number of unexplored nodes is at least 
halved and MainDispatch() can be applied again. Moreover, the number of newly 
explored nodes is at least d/2, i.e. there are enough of them to distribute the 
0{d) overhead among them. 

There is a major problem with this approach: while all components ofU\{v} 
are reachable from the already explored nodes, the same is not true for the 
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components of U \ TTy^x- We must make sure that at least some components (or 
the path itself) are reachable. 

To achieve that, the idea of BypassHardNode() is to proceed along the path 
from V to X, taking as long a jump ahead as possible: in other words, a goes 
to the furthest reachable subtree hanging from 7 t„ a, (w is the parent if w') 
and explores from there (first Tyy/, then the path from w to x and the hanging 
subtrees), while b (after/if becoming unblocked) will take care of the path (and 
hanging subtrees) between v and w. If a finishes first, the remaining unexplored 
path between v and w will be collapsed and the number of unexplored nodes is at 
least halved. If b finishes first, there are several possible cases but two principal 
outcomes: 

(1) The unexplored nodes are limited to the subtree Ty,'. The corresponding 
unexplored path is collapsed (procedure CollapsPath() is called). 

(2) We are essentially in the same situation as we started, but we have moved 
along the line from u to a; to at least w. The process (bypassing) is repeated until 
X is reached and/or the unexplored nodes are limited to a subtree (and possibly 
its root) hanging from the path (thus, Collapsepath() can be called to terminate). 

In case 2) a new bypass path will be needed to reach a smaller area of unex- 
plored nodes. The crucial observation is that the bypass does not go below w, 
otherwise would not have been the furthest reachable subtree. This means 
that the bypass of iteration i > 1 uses only nodes explored in the previous 
iteration. 

As each bypass path (except in the last iteration, when it is used in Col- 
lapsePath()) is used only a constant number of times, the overhead of iteration 
i > 1 can be charged to iteration i — 1. We are left with the overhead 0{d) 
for the first iteration and for the possible CollapsePath() in the last iteration. 
Since BypassHardNode() has explored at least d/2 nodes, this overhead can be 
distributed among them. 

4 Analysis 

Due to space constraints, we omit correctness and complexity analysis of Algo- 
rithm . Full proofs can be found in the technical report. 
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Abstract. This paper concerns construction of additive stretched span- 
ners with few edges for n-vertex graphs having a tree-decomposition 
into bags of diameter at most S, i.e., the tree- length <5 graphs. For such 
graphs we construct additive 25-spanners with 0(Sn log n) edges, and ad- 
ditive 45-spanners with 0{Sn) edges. This provides new upper bounds for 
chordal graphs for which 5 = 1. We also show a lower bound, and prove 
that there are graphs of tree-length 5 for which every multiplicative 5- 
spanner (and thus every additive (5— l)-spanner) requires 
edges. 

Keywords: additive spanner, tree-decomposition, tree-length, chordal- 
ity 



1 Introduction 

Let G be a connected graph with n vertices. A subgraph iL of G is an (s,r)- 
spanner if dniu, v) ^ s • dc(u, v) + r for all pair of vertices u, v of G. An (s, 0)- 
spanner is also termed multiplicative s-spanner, and an (1, r)-spanner is termed 
additive r-spanner. An (s, r)-spanner is also an (s -I- r, 0)-spanner (in particular, 
an additive r-spanner is a multiplicative (r-|- l)-spanner), but the reverse is false 
in general. 

The main objective is to construct for a graph an (s, r)-spanner with few 
edges. There are many applications of spanners, for example, the complexity 
of a lot of distributed algorithms depends on the number of messages, itself 
depending on the number of edges [23,21]. Sparse spanners occur also in the 
efficiency, of compact routing schemes [24]. Unfortunately, given an arbitrary 
graph G and three integers s, r and m, determine whether G admits an (s, r)- 
spanner with m or fewer edges, is NP-complete [22], even if we restrict r = 0 (see 
also [6,5,7,11,18] for complexity issue). Best known results on (s, r)-spanners 
for general graphs are summarized in Table 1. 

An interesting question still left open is to know whether every graph has an 
additive (2/c — 2)-spanner with edges, for /c > 2. In the affirmative, this 

would generalize the result of [12, 16] {k = 2) and implies also the observation 

* Supported by the European Research Training Network COMBSTRU-HPRN-CT- 
2002-00278. 
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Table 1. 



(s, r)-spanner 


edges 


reference 


(1,2) 




[12, 16]“ 


{2k- 1,0) 


0{rd+^^'^) 


[2,27,3]‘ 


{k- l,2k-A) 


0{kn^+^l^) 


[16], fc ^ 4 even 


(fc - 2 -b e, 2fc - 2 - e) 


0{e-^krd+^/'“) 


[16], fc ^ 3 odd, e > 0 


(fc — 1 -b e, 2fc — 4 — e) 


0(e-ifcni+iA) 


[16], fc ^ 4 even, e > 0 


(1 -b e, /3(c, k)) 


0(/3(e, k)rd+^/'^) 


[16]“, fc is 2 



“ The bound of [12] was n) edges but with a better running time, 

0{n^^'^) v.s. 0(v?). 

^ This observation due to [2] is based on the classical result (see [1]) that every graph 
with at least edges has a cycle of length at most 2fc, for every k ^ 1. 

[27] and [3] gave respectively an 0(kmn^^'°) and 0{km) time algorithm for the 
construction of such spanner. 

Actually f3{e, k) = /jmax{iogiogfc-ioge,iog6} for 2 ^ ^ logn and fixed 0 < e < 1. 



of [2]. Another interesting question is to known whether the 0(n^+^/^) edge 
bound for multiplicative {2k — l)-spanner is tight or not. This bound directly 
relies to an 1963 Erdos Conjecture [17] on the existence of graphs with 
edges and girth at least 2k + 2. This has been proved only for k = 1,2,3 and 
k = 5. 

Better bounds can be achieved if we restrict spanners to be trees, or if par- 
ticular classes of graphs are considered: planar graphs [18], and more structured 
graphs (e.g., see [20] and [26] for a survey). Among them, the class of chordal 
graphs is of particular interests [6,22,9]. A graph is k-chordal if its induced cy- 
cles are of length at most k. Chordal graphs coincide with 3-chordal. Here below 
are summarized the best constructions for fc-chordal graphs. 



chordal 


{s, r)-spanner 


edges 


reference 


3 


(2,0) 




[22] 


3 


(3,0) 


0{n log n) 


[22] 


3 


(1,3) 


0(n log n) 


[9] 


k 


(1, fc -b 1) 


e{n) 


[9], A: is 3 



Tree-decomposition is a rich concept introduced by Robertson and Sey- 
mour [25] and is widely used to solve various graph problems. In particular 
efficient algorithms exist for graphs having a tree-decomposition into subgraphs 
(or bags) of bounded size, i.e., for bounded tree-width graphs. 

The tree-length of a graph G is the smallest integer S for which G admits a 
tree-decomposition into bags of diameter at most i5. It has been formally intro- 
duced in [15], and extensively studied in [13]. Chordal graphs are exactly the 
graphs of tree-length 1, since a graph is chordal if and only if it has a tree- 
decomposition in cliques (cf. [10]). AT-free graphs, permutation graphs, and 
distance-hereditary graphs are of tree-length 2. More generally, [19] showed that 
fc-chordal graphs have tree-length at most k/2. However, there are graphs with 
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bounded tree-length and unbounded chordality^, like the wheel. So, bounded 
tree-length graphs is a larger class than bounded chordality graphs. 

For several problems involving distance computation, like the design of ap- 
proximate distance labeling schemes [19] or of near-optimal routing schemes [13], 
tree-length S graphs are a natural generalization of chordal graphs, and their 
tree-decomposition induced can be successfully used. In this paper we highlight 
a new property of bounded tree-length graphs: the design of sparse additive 
spanners. The following table summarizes the bounds we have obtained on the 
minimum number of edges of additive spanner. 



tree-length 


(s, r)-spanner 


edges 


5 


(1,25) 


0(5n log n)“ 


5 


(1,45) 


0(Sn) 


5 


(5,0) 


I7(n^+'), 6^1/ [5/2] 






for** 5 = 1,2, 3, 4, 5, 6, 9, 10 



“ In a forthcoming paper [14], we show that the number of edges can be reduced to 
0(Sn + nlog n). 

^ The last line relies to the Erdos’s Conjecture, but in any case e ^ 1/6>(S) (cf. Table 
of Corollary 1). 

Thus our first result provides an additive 2-spanner with O(nlogn) edges for 
chordal graphs (for 5=1), improving [9] and also implying [22]. 

In this paper, we also compare our algorithm to the Chepoi-Dragan-Yan’s 
algorithm (CDY) used successfully for fc-chordal graphs [9]. For small chordal- 
ity k, our algorithm produces an additive 2/c-spanner (or (2k — 2)-spanner of 
odd k) with 0(n) edges (recall that S ^ k/2), whereas CDY’s algorithm con- 
structs an additive (k + l)-spanner with 0(n) edges, which is better for k ^ 4. 
However, we show in Section 3 that the CDY’s algorithm cannot be used for 
bounded tree-length graphs of large chordality. More precisely, we construct a 
worst-case graph of tree-length 3 and chordality f2(n^^^) for which the CDY’s 
algorithm produces an l7(n^/^)-spanner with 0(n) edges. A generic algorithm 
would certainly combine both algorithms. 

The lower bound shows that every additive o(5)-spanner requires f2(n}'^'^) 
edges. However, combined with our two upper bounds, this naturally leads to 
the question of whether there exists, for every tree-length 6 graph, an additive 
0(5)-spanner with 0(n log n) or even with 0(n) edges. 

We now recall the definition of tree- decomposition introduced by Robertson 
and Seymour in their work on graph minors [25]. A tree-decomposition of a 
graph G is a tree T whose vertices, called hags, are subsets of V (G) such that: 

1- UxeF(T) ^ ~ 

2. for all {u,v} G E(G), there exists X G V(T) such that u,v € X; and 

3. for all X,Y, Z G V (T), if Y is on the path from X to Z inT then XnZ CY. 

The length of tree-decomposition T of a graph G is TLna,xx^v(T) 
da(u,v), and the tree-length of G is the minimum, over all tree-decompositions 
T of G, of the length of T. 

The chordality is the smallest k such that the graph is fc-chordal. 



1 
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A well-known invariant related to tree-decompositions of a graph G is the 
tree-width, defined as minimum of maxxg\/(T) over all tree-decompositions 

T of G. We stress that the tree- width of a graph is not related to its tree-length. 
For instance cliques have unbounded tree- width and tree-length 1, whereas cycles 
have tree-width 2 and unbounded tree-length. 

The paper is organized as follows. In Section 2, we present the first algo- 
rithm (Line 1 of the previous table). Section 3 presents the second algorithms 
(Line 2) and the CDY’s algorithm, both based on a layering-tree of the graph. 
We conclude in Section 4 with the lower bound. 

2 Additive 25-Spanner with 0(5n log n) Edges 

Theorem 1. Every n-vertex graph of tree-length 6 has an additive 26-spanner 
with 0{Snlogn) edges. 

The remaining of this section concerns the proof of Theorem 1. For this 
purpose we need several ingredients, and of two basic properties. 

It is well known that every tree T has a vertex u, called median, such that 
each connected component of T\{u} has at most \\V{T) \ vertices. A hierarchical 
tree of T is then a rooted tree H defined as follows: the root of H is the median 
of T, u, and its children are the roots of the hierarchical trees of the connected 
components of T \ {m}. Observe that T and H share the same vertex set, and 
that the depth of H is at most^ log |E(T)|. 

A tree-decomposition is reduced if any bag is contained in no other bags. A 
leaf of such decomposition contains necessarily a vertex contained in none other 
bags. Thus by induction the number of bags of a reduced tree-decomposition 
does not exceed maxjn — 1, 1} for an n-vertex connected graph (cf. [4]). 

From now, G is a (connected) graph with n vertices and of tree-length 6. T 
denotes a tree-decomposition of G of length 6 supposed reduced, and H denotes 
a hierarchical tree of T. So, the depth of H is at most logn. We denote by 
NCAh{U,V) the nearest common ancestor of U and V in H. 

Property 1. Let U, V be two vertices of T and let Q be the path in T from U to 
V, and let Z = NCAff(U, V). Then, Z G Q, and Z is an ancestor in H of all the 
vertices of Q. 

Proof. By construction, the subtree induced by Z and its descendants in PI is 
a connected component of T, say A. Thus, Z, U, V are in A, but U and V are 
in two different components of T \ {Zj. Thus in T, the path Q from U to V 
is wholly contained in A and intersects Z. So, Z G Q and Z is ancestor of all 
vertices of Q in iL. □ 

We assume that T is rooted. For every vertex u of G, the bag of u, denoted 
by 'B(u), is a bag A of T of minimum depth such u G X. Observe that, once T 
has been fixed, 'B(u) is unique for each u. 

All the logs are in base two. 



2 
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Property 2. For every edge {u, f } of G, either 

1. ®(u) is an ancestor of 'B{v) in T and u G ®(f), or 

2. 'B(w) is an ancestor of ®(u) in T and v G 3(u). 



Proof. Let X be any bag of T with a vertex x. Since x G X LI 'B(x), by Rule 3 of 
tree-decomposition’s definition and by minimality of the depth of ‘B(x), we have 
that AT is a descendant of ‘B(x). Now, by Rule 2 there is a bag, say Y, containing 
u and V. It follows that F is a descendant of 'B(m) and of i.e., either 'B(m) 
is an ancestor of ‘B(n) or the reverse. If 'B(u) is an ancestor of ‘B(n), then 'B(f) 
is on the path from 'B(u) to V in T. By Rule 3, u G 'B(v). Similarly if 'B(v) is an 
ancestor of 'B(m), then v G ‘B(u). □ 

We associate with every bag X of T a shortest path spanning tree of G rooted 
at an arbitrary vertex rx G X . A vertex m of G is good w.r.t. Sx if ®(u) is a 
descendant of X in P[. Otherwise u is bad w.r.t. Sx- Clearly a vertex u can be 
good for some trees and bad for others, but since H is of depth at most logn, u 
is good for at most logn trees. 

For every tree Sx, we define Sx, the tree obtained from Sx by removing 
recursively each bad leaf w.r.t. Sx- The spanner of G claimed by Theorem 1 is 
simply the graph defined by G' := UxeV'(T) ^x- 

Lemma 1. G' is an additive 26-spanner ofG. 

Proof. Let u,v be two vertices of G, and let Z = NCA//('B(u), 'B(w)). By defi- 
nition, u,v are good in Sz, thus u,v are both in S'^. By Property 1, Z belongs 
to the path from 'B(m) to 'B{v) in T, thus there is a vertex z G Z such that 
dciu, v) = dc(u, z) + dc{z, v). Since S'^ is a shortest path rooted at rz, we have 
dG'{u,v) ^ ds'^{u,v) ^ dG{u,rz) + dG{rz,v)- Note that dG{z,rz) ^ 6, and by 
the triangle inequality, we have: 

dG'{u,v) < ds’^{u,v) ^ dG{u,z) -G dG{z,rz) -b dG{rz,z) -G dG{z,v) 

< dG{u, z) + dG{z, v) + 2dG{z, rz) 

< dG{u,v) +25 . □ 

The difficult part is to upper bound the number of edges. 

Lemma 2. G' has 0{6nlogn) edges. 

Proof. For every vertex n of G and every bag X of T, we define cost(n, AT) = 1 
if M G S'x, and cost (n, A) = 0 if u ^ S'^. Clearly, the number of vertices of S'^ 
is X)m6V(g) cost(n, A), and so the total number of edges of G' is 



\E{G')\ < E 



cost(u,AT) 



< E E cost(n, A) 



XeT \\uGV{G) ) ) Xf^TudViG) 

The problem to estimate this upper bound is that neither cost(u, AT), nor 
'Ylix cost(w, X) are easy to calculate. To overcome this problem, we assume that 
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each vertex u has a “charge” for each bag X, denoted by charge(w, X), that 
can be exchanged with its neighbors under the following condition. Initially, 
charge(rt, X) = cost(rt, X), for all u and X. Then, while there is a vertex u and 
a bag X such that: 

1. M is bad in S'x, and 

2. charge(u, X) > 0, 

charge(w, X) is decremented and charge(u, X) incremented, where v is one of the 
children of u in S'x (observe that there is no leaf u in S'x that is bad in S'x)- 
Such a procedure converges, and we denote by charge* (m, X) the final charge of 
u w.r.t. X. 

Clearly, '^x the other hand we 

have: 

EE charge* (m, X) = EE charge* (m, X) < n • max < ^ charge* (m, X) > 
Xu u X “IjC J 

From the above procedure, if u is bad in S'x then charge* (u,X) = 0. Say in 
other words, if u is good in t trees, then the sum ^x charge* (u, X) has at most 
t non- null terms. We have seen that a vertex u is good in at most logn trees. 
Thus, charge* ( m, X) ^ maxx {charge* (u, X)} • logn. To summarize, 

\E{G')\ < charge*(u,X) < n log n • max (charge* (u,X)} 

u.X 

X u 

It remains to show that charge* (u,X) = 0{5), for all u and X. We assume 
that u is good in S'x (otherwise we have seen that charge* (n,X) = 0). Observe 
that, in S'x, u can receive the charge of v only if: 1) v is bad, 2) v is an ancestor 
of u, and 3) there is no good vertex on the path from u to u in S'x- So, let v be 
the nearest ancestor of u that is good in S'x {v is set to the root if such vertex 
does not exist). The charge received by u (w.r.t. X) is thus at most ds'^{u,v), 
and thus its total charge is charge* (n, X) < 1 -I- {u, v). 

Let P = xq,xi, .. - ,Xp be the path in S'x from u = xq to v = Xp. P is a 
shortest path in G and p = da{u,v) ^ 1. Our aim is to show that p < 3<5. Let Q 
be the path in T from ‘B(n) to 'B(u). 

Claim. For every 0 < i < p, 'B(xi) ^ Q. 

Let Z = XCAh{‘B{u),‘B{v)). By construction of P, every Xi G P \ {xo,Xp} is 
bad, i.e., 'B(xi) is not a descendant of Z. By Property 1, all the bags of Q are 
descendant of Z, so 'B(xi) ^ Q, for every 0 < z < p, as claimed. 

Let j be the largest index such that Xj € ®(zz). Obviously, j ^ 6 because 
Xo,Xj belongs to the same bag, 'B(xn). W.l.o.g. p > j, since otherwise p < <5 and 
we are done. Let Y = NCAt(‘B(m), 'B( u)). 

Claim. Xj G Y. 

If 'B(u) = Y, then we are done. Assume, ‘B(zt) yf Y. If j = 0, then we get 
a contradiction applying Property 2 on the edge (zt, a;i}. Indeed, either 'B(xi) 
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is an ancestor in T of 'B(m) and x\ G 'B(m), contradicting j = 0, or 'B(u) is 
an ancestor of In this latter case, since the path P intersects Y and 

'B(u) ^ Y, the sub-path xi, . . . ,Xp must intersect 'B(u) in some Xi with z ^ 1, 
contradicting again j = 0. So we have 0 < j < p, and thus ‘B(xj) ^ Q. Now, 
'B(xj) must be an ancestor in T of 'B(u), because Xj G 'B(m). Since 'B(xj) ^ Q, 
it follows that ‘B(xj) is an ancestor in T of all the ancestors of 'B(m) in Q. In 
particular Xj belongs to all the ancestors of 'B(m) in Q, so Xj G F as claimed. 

Let k be the largest index such that Xk G Y. Since da{xj,Xk) ^ S, we have 
k ^ 26. W.l.o.g. k < p, since otherwise p ^ 26 and we are done. Since p > k, the 
vertex Xk+i exists. Let Y' = NCAT(®(a;fe+i), ®(a;p)). 

Claim. Y' G Q, and x\^ G Y' . 

First observe that for every t > k, ‘B{xt) is a descendant of Y. Otherwise, since 
the sub-path xt, ... ,Xp must intersect y, Y would contain some Xi with i > k, 
contradicting the definition of k. In particular, ‘B{xk+i) is a descendant of Y. 
Since 'R{xk+i) is descendant of Y' , it follows that Y' is a descendant of Y, and 
thus Y' is on the path in T from Y to ‘B(xp). I.e., Y' G Q as claimed. Let us 
apply Property 2 on the edge {xk,Xk+i}. ‘B{xk) is ancestor of y {xk G Y), and 
thus ‘B{xk) is ancestor of 'R{xk+i), and thus Xk G 'R{xk+i). Thus Xk belongs 
to all the bags of the path in T from Y to 'B{xk+i). In particular, Xk G Y' 
completing the proof of the claim. 

The sub-path Xk+i, . . . , Xp must intersect Y' , say in some vertex Xf. Since 
£ > k, ‘B{xi) is a descendant of Y. It is also an ancestor of Y' . Thus, ‘B{xi) 
belongs to the path in T from Y to Y' , i.e., ‘B{xi) G Q. Since ‘B(xi) ^ Q for 
z < p, it turns out that £ = p. We conclude with the fact that da{xk,xi) < <5, 
thus p = £^k + 6^‘i6. 

Therefore, \E{G')\ < (3(5 -I- l)nlogn, completing the proof of Lemma 2. □ 

Theorem 1 directly follows from lemmas 1 and 2. 

3 Additive Spanner with a Linear Number of Edges 

We present in this section an algorithm to construct for any tree-length 6 graph 
an additive 45-spanner with 0{6n) edges. Since for every graph the tree-length 
is at most half the chordality, it follows that this spanner is as well an additive 
2/c-spanner (or {2k — 2)-spanner of odd k) with 0{kn) edges if the graph is k- 
chordal. This latter construction is far from the optimal. Indeed, Chepoi et al [9] 
have presented an algorithm which computes, for any graph G of chordality k, 
an additive {k + l)-spanner with 0{n) edges. 

The CDY’s algorithm, and our algorithm as well, is based on a Layering-tree, 
technique we present in Section 3.1 with the CDY’s algorithm. Our variant is de- 
tailed in Section 3.2. In Section 3.3, we compare both algorithms and show that, 
there are tree-length 3 graph for which the CDY’s algorithm returns an additive 
l7(n^/^)-spanner whereas our algorithm guarantees an additive 12-spanner^ with 
0{n) edges. 

® Actually, on the counter-example, it produces an additive 3-spanner. 
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3.1 Layering- Tree and the CDY’s Algorithm 

Let G be a graph with a distinguished vertex s. For every integer i ^ 0, we 
define L* := {u G V{G) \ da{s,u) = i}. A layering partition of G is a partition 
of each set L* into L\, . . . , L^. such that u,v G L* if and only if there exists a 
path from u to v using only intermediate vertices w such that dG{s,w) ^ i. 

Let H be the graph whose vertex set is the collection of all the parts L*. In 
H, two vertices L* and Lt, are adjacent if and only if there exists u G L® and 
V G L®/ such that u and v are adjacent in G (see Fig. 1 for an example). The 
vertex s is called the source of H. 

Lemma 3. [8] The graph H , called layering-tree of G, is a tree and is com- 
putable in linear time. 

In each part W oi H, a, special vertex rw is distinguished®^ (grayed vertices 
on Fig. 1(a)). 






Fig. 1. (a) A 5-chordal graph, (b) a layering-tree of it, and (c) the CDY’s spanner. 

The CDY’s spanner of G is composed of the following edges (see Fig. 1(c)): 

1. all the edges of a shortest path spanning tree S' of G rooted at s, and 

2. for every vertex u of a part IF of il, the edge {x,y} (if it exists) such that 
X is the nearest ancestor of tt in S with a neighbor y ancestor of r\\r. 



3.2 Additive 0(^)-Spanner with 0{6n) Edges 

Theorem 2. Every n-vertex graph of tree-length 6 has an additive 46-spanner 
with at most (26 -I- l)(n — 1) edges. 

The construction is also based on the layering-tree H of G, presented in 
Section 3.1, and on the following result: 

In the original CDY’s algorithm rw is given by a fixed policy we do not detail 
here. Here, we consider that any vertex of W can be chosen, it is simpler but more 
powerful. 
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Lemma 4. Let H be a layering-tree of G. For every part W of H , there is a 
vertex r of G, called center ofW, such that for all u,v G W, dG{u,v) ^ 3i5 and 
dciu^r) ^ 26. Moreover, for every 5, these hounds are best possible. 

Proof. Let T be a tree-decomposition of G of length 6, W.l.o.g. T is supposed 
rooted at a bag containing s, the source of H. Let W he a part of H at distance 
j of s. Let X be the bag of T that is the nearest common ancestor of all the of 
bags containing a vertex of W, and let dx = max^^^gx dc{u, v) be its diameter. 
Let us prove that for every u G W, do(u,X) ^ 6. In this way, we will prove 
that: 

— Vu, v G W, dciu, v) < dciu, X) dx + da{v, X) ^ 3i5; 

— 'iu gW and Vr G X, dc{u, r) < dciu, X) -h dx ^ 26. 

Let u be an arbitrary vertex of W. Consider a vertex v G W such that there 
are two bags, U and V, such that: u G U , v G V , and X = NCAt{U,V) (we 
check that v, U, V exist). Let P be a shortest paths from s to u, P intersects X. 
Let X be the closest from s vertex in PC X. Since u, v are both in W , there exists 
a path Q from u to v using only intermediate vertices w such that dG{s,w) ^ i. 
Q intersects X at a vertex r (see Fig. 2(a)). 



Fig. 2. A part of H is of diameter at most 35 and of radius at most 25. 

Note that dG(s,u) = i = dcisjx) da{x,u) and da{s,r) Gi dG{s,x) 6. 
So, dG{s,r) ^ i 6 — dG{x,u). If dG{x,u) ^5-1-1 then dG{s,r) ^ i — I: a 
contradiction since r G Q. So, dG{u,X) ^ dG{u,x) ^ 5 as claimed. 

These bounds are best possible for each 5^1. For 5 = 1, the graph depicted 
on Fig. 2(b) is chordal, u, v, w belong to the same part and dG{u, v) = dG{u, w) = 
dG{v,w) = 3. By replacing each edge by a path of length 5, the tree-length of 
this subdivision increases to 5, u, v, w still belong to the same part and are at 
distance 35. We check that a center c for W can be chosen arbitrarily among 



U 




(a) 



(b) 
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{x, y, z, r, s} and attains a radius 2S. Moreover, if c ^ {x, y, z, r, s}, one can prove 
that either da{u,c) > 26 or da{v,c) > 26 or da{w,c) > 26. Thus the radius of 
the part containing u, v, w is exactly 26. □ 

The spanner satisfying Theorem 2 is simply the graph defined by G' := 
'S'UUwev(ff) where S' is a shortest path tree spanning G, H a, layering-tree 
of G, and Sw a shortest path tree spanning W and rooted at a center of W. 

Lemma 5. G' is an additive 46-spanner of G. 

Proof. Let u, v be two vertices of G, and let A, B be the two parts of H containing 
respectively u and v. Let us show that every path from m to f must intersect the 
part W = NCAb(A,B). 

This clearly holds if W = A or W = B. If W A and W ^ B, (so in 
particular A yf B), then, by definition of H , every path intermediate vertex of a 
path from m to w must intersect an ancestor of A and of B. So, by induction, it 
must intersect the nearest common ancestor of A and of B, W . 

Let u', v' G W he the ancestors in S' of m and v. We observe that dciu, u') = 
dniA.W). Since G' contains a shortest path spanning tree of G rooted at s, it 
follows that dH{A,W) = do'{u,u'), and finally, da(u,u') = do'{u,u'). Similarly, 
dciv, v') = do'{v, v'), and thus do'{u, u') dc'iv, v') ^ da{u, v). 

Using the tree Sw contained in G' and rooted at the center of W, and by 
Lemma 4, we have do'{u',v') ^ 46. Therefore, we obtain: 

da'{u, v) < da'{u, u') + dciu', v') + dcW , v) < daiu, w) -I- 4i5 . 

□ 



Lemma 6. G' has at most {26 + l)(n — 1) edges. 

Proof. S has n — 1 edges. Every tree Sw has at most |1U| leaves and so at most 
2i5|lU| edges, except when W is the root of iL. In this latter case, Sw contains 
no edges. The parts of PI are disjoint, so the number of edges of G' is at most 
n-l + 26{n-l) = {26+l){n-l). □ 

3.3 Counter-Example 

Theorem 3. There is a graph of tree-length 3 and with n + o{n) vertices for 
which the CDY’s algorithm, for any choice of the special vertices, constructs an 
additive Q{n^^^)- spanner with 0{n) edges. 

The remaining of this section is devoted to the proof of Theorem 3. 

The Cartesian product of two graphs A and B is the graph denoted by 
Ax B such that V{Ax B) = {{x,y) \ x G V{A),y G V{B)}, and E{A x B) = 
{{{x,x'),{y,y')) I (a; = a;' and {y,y') G E{B)) or {y = y' and {x,x') G E{A))}. 
E.g., the mesh is the Cartesian product of two paths. Let Kt and Pt denote 
respectively the complete graph and the path with t vertices. 

We set Dp = Kp X Kp. The counter-example, denoted by Go, is the graph 
Dt X Pt-i, so composed of t — 1 copies D],. . . ,D\ ^ of Dt, with an extra vertex 
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Fig. 3. Counter-example Go- 



s connected to all the vertices of Dj (see Fig. 3). To every vertex rt of Go, s, 
we denote by P{u) the copy of the path Pt-i containing u. Hereafter, we set 
t := , so that Gq has t^(t — 1) -I- 1 = n -I- vertices. 

A subgraph H of a, graph G is isometric if dnix, y) = dc{x, y), for all x,y £ 
V (H). It is a natural generalization of induced subgraph (any isometric subgraph 
is clearly an induced subgraph). We have: 

Lemma 7. [15] The tree-length of any isometric subgraph of G is no more than 
the tree-length of G. 

Let u, u, w be three vertices of some D\ inducing a path of length two (since 
Dt is of diameter two, such vertices exists). We check that the graph induced by 
the vertices of the three paths P{u), P{v), P{w) is a isometric mesh of Gq. This 
mesh has t — 1 rows and 3 columns. 

Lemma 8. Gq has chordality at least 2t = L?(n^/^), and tree-length 3 for t ^ 5. 

Proof. In a (t — 1) X 3 mesh, the perimeter is an induced cycle of the mesh of 
length 2t. Since this mesh is an isometric subgraph of Gq, it follows that Go is 
of chordality at least 2t. 

It is proved in [15] that the tree-length of the mesh with p rows and q columns 
is min {p, q} Up ^ q or pis even, and is p—1 otherwise. In particular, the ft—l) x3 
mesh has tree- length 3 if t ^ 5. By Lemma 7, Go has tree-length at least 3 for 
t ^ 5. 

We obtain a tree-decomposition of Go of length 3 by considering a path 
Xo,Xi, . . . Xt -2 where Xq = {s}uy(D(i), and W = V{d()UV{DI+^) for i ^ 1. 

□ 

A dominating set of a graph G is a set of vertices R such that for every vertex 
M of G either u £ R or u is adjacent to a vertex of R. 

Lemma 9. If R is a dominating set of Dt, then |i?| ^ t. 
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Proof. The graph Dt is the union of two disjoint sets of cliques 3Ci and IK 2 , each 
one composed of t disjoint copies of Kt, so that every edge belongs either to a 
clique of %i or of % 2 . Every clique of Xi intersects each clique of X 2 and vice 
et versa. Assume |i?| < t. By the Pigeon Hole Principle, there is a clique A G Xi 
with no vertices of R. Similarly, there is a clique B G X 2 with no vertices of 
R. The cliques A and B share exactly one vertex, say u (otherwise there would 
exists an edge that belongs to a clique of and to a clique of X 2 ). All the 
incident edges of u belongs either to A or to H. It follows that u is not adjacent 
to any vertex of R: a contradiction. □ 

Proof (of Theorem 3). Let G' be the spanner obtained by CDY’s algorithm 
applied on the source s of Gq. The parts of H are the set LP = {s}, and U = 
V{D\) for t ^ 1. The spanning tree S rooted at s used in G' contains exactly 
the edges incident to s and the edges of the paths Pt-i- No edge of any is 
contained in S. 

We assume that a special vertex ri has been arbitrarily selected for each part 
L* of iL, and let i? = {ri, . . . , rt-\}. 

Let M G L* be a vertex of Go, t yf 0. Observe that if u and ri are not adjacent 
in Go, then there is no edge in Gq (and thus in G') between P{u) and Pfri). 
Let R' be the projection of R on D\~^\ R' = {m G \ V{P{u)) n i? yf 0 }. 
|i?'| = |i?| = t — 1, so R' is not a dominating set of D\~^ (Lemma 9). 

Let u be a vertex of D\~^ with no neighbors in i?', and let r' G R' . From the 
above observation, in G' , there is no edge between P{v) and P{r'). During the 
second phase of the CDY’s algorithm, only edges incident to are added, for all 
t ^ 1. It follows that every vertex of P{v) has no incident edges in G', excepted 
those of P{v). So, dc{v,r') ^ 2(t — l) + 2 = whereas dog{v,r') = 2. G' 

is an additive f?(n^/^)-spanner, as claimed. □ 

4 Lower Bound 

Let m(n, g) be the maximum number of edges contained in a graph with n 
vertices and of girth at least g. It is clear that there exist at least an n-vertex 
graph for which every additive (g — 3)-spanner (or multiplicative (g — 2)-spanner) 
needs m{n, g) edges. Indeed, any graph G of girth g and of m(ji, g) edges has 
no proper additive {g — 3)-spanner: removing any edge {m, u} of G implies that 
dniu^v) ^ g — 1 = dc{u, v) + g — 2 and thus that H is not an additive {g — 3)- 
spanner of G. 

Theorem 4. For each S ^ 1, there exists a graph of n + 3S — 2 vertices and of 
tree-length S for which every multiplicative 6-spanner (and thus every additive 
((5 — 1) -spanner) needs m{n, (5 + 2) + 3i5 — 1 edges. 

Proof. Consider a graph G with n vertices, a girth at least (5 + 2, and with 
m{n, 5 + 2) edges. The diameter of G is at most 6. Indeed, otherwise G has two 
vertices, say u and v, at distance 5+1. So augmenting G by the edge u, v would 
provide a graph with n vertices, a girth at least 5 + 2, and with m{n, 5 + 2) + 1 
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edges: a contradiction with the definition of m{n,6 -I- 2). So G is of tree-length 
at most its diameter, that is < 6. 

Now we construct a graph G* obtained from G by selecting an edge of G, 
say {u, f }, and by adding a path of length 3S — 1, so that G* contains a cycle G 
of length 3(5. The graph G* has n + 36 — 2 vertices, a girth at least 5 + 2, and 
m{n, (5 -I- 2) -I- 3(5 — 1 edges. Again, G* does not contain any proper multiplicative 
(5-spanner. 

The tree-length of G* is exactly 5 observing that the tree-length of a graph 
composed of two subgraphs, say G and G, sharing a vertex or an edge is the 
maximum between the tree-length of G and the tree-length of G (because G 
and G are isometric subgraphs, and the common vertex or edge can be used to 
combined both optimal tree-decompositions). As shown in [15], the tree-length 
of a cycle of length k = 36 is \k/3^ = 6. □ 

An Erdos Conjecture [17] claims existence of n- vertex graphs with 
edges and of girth at least 2fc -I- 2. This has been proved only for k = 1,2,3 and 
A: = 5. It is known however that there are graphs of girth at least 2k + 2 with 
edges. From Theorem 4, we have: 

Corollary 1. For every constant 6^1, there are graphs with 0(n) vertices and 
tree-length 6 for which every multiplicative 6-spanner requires edges, 

where e ^ 1/ [(5/2] for (5^6. Moreover, for every 6, 1/0(6), where the best 

current lower hound on e is given by the table below. 

Proof. For each fixed integer k ^ 1, let f(k) be the largest real such that there 
exists an n- vertex graph of girth at least 2 A: -I- 2 and with edges. We 

have m(n, 2k + 2) = 

Consider the worst-case graph Gs given by Theorem 4. It has at most 5n/2 
vertices (recall that 6 < n/2 because the chordality of a graph is at most n — 1), 
and at least m(n, 5 + 2). Note that m(n, (5-1-2) < m(n,6+ 1). So, Gs has at least 
m(n, 2 [(5/2] -I- 2) = edges. 

It is known that f(k) = 1/k for all A: ^ 1, if the Erdos’s Conjecture holds. The 
following table summarizes the best known results on f(k). Complete references 
can be found in [27]. 



k = [5/2] 


f(k) 


1,2, 3, 5 


= 1/k 


4 


^ l/(k + 1) 


6,7 


^ l/(A:-t2) 


k = 2r, r ^ 4 


^ l/(3A:/2 - 1) 


A: = 2r — 1, r ^ 5 


^ l/(3A:/2 - 3/2) 



□ 
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Abstract. In this paper, we address a particular case of the general 
problem of A labellings, concerning frequency assignment for telecommu- 
nication networks. In this model, stations within a given radius r must 
use frequencies that differ at least by a value p, while stations that are 
within a larger radius r' > r must use frequencies that differ by at least 
another value q. The aim is to minimize the span of frequencies used in 
the network. This can be modelled by a graph labelling problem, called 
the L(p,q) labelling, where one wants to label vertices of the graph G 
modelling the network by integers in the range [0; M], while minimizing 
the value of M. M is then called the A number of G, and is denoted by 
A?(G). 

Another parameter that sometimes needs to be optimized is the fact 
that all the possible frequencies (i.e., all the possible values in the span) 
are used. In this paper, we focus on this problem. More precisely, we 
want that: (1) all the frequencies are used and (2) condition (1) being 
satisfied, the span must be minimum. We call this the no-hole L{p, q) 
labelling problem for G. Let [0; M'] be this new span and call the v 
number of G the value M' , and denote it by izJ(G). 

In this paper, we study a special case of no-hole L{p, q) labelling, namely 
where q = 0. We also focus on some specific topologies: cycles, hyper- 
cubes, 2-dimensional grids and 2-dimensional tori. For each of the men- 
tioned topologies cited above, we give bounds on the Vq number and show 
optimality in some cases. The paper is concluded by giving new results 
concerning the (general, i.e. not necessarily no- hole) L(p,q) labelling of 
hypercubes. 
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1 Introduction 

In this paper, we study the frequency assignment problem, that arises in wireless 
communication systems. We are interested here in minimizing the number of 
frequencies used in the framework where radio transmitters that are geographi- 
cally close may interfere if they are assigned close frequencies. This problem has 
originally been introduced in [12] and later developed in [9], where it has been 
shown to be equivalent to a graph labelling problem, in which the nodes repre- 
sent the transmitters, and any edge joins two transmitters that are sufficiently 
close to potentially interfere. The aim here is to label the nodes of the graph in 
such a way that: 

— any two neighbours (transmitters that are very close) are assigned labels 
(frequencies) that differ by a parameter at least p ; 

— any two vertices at distance 2 (transmitters that are close) are assigned labels 
(frequencies) that differ by a parameter at least q ; 

— the greatest value for the labels is minimized. 

It has been proved that under this model, we could assume the labels to 
be integers, starting at 0 [8[. In that case, the minimum range of frequencies 
that is necessary to assign to the vertices of a graph G is denoted \p(G), and 
the problem itself is usually called the L{p,q) labelling problem. The frequency 
assignment problem has been studied in many different specific topologies [8, 
14, 17, 1, 3, 5, 13, 2, 15, 16]. The case p = 2 and g = 1 is the most widely studied 
(see for instance ]6, 11, 10, 4]). Some variants of the model also exist, such as the 
following generalization where one gives k constraints on the k first distances 
(any two vertices at distance 1 < i < A: in G must be assigned labels differing 
by at least Si). One of the issues also considered in the frequency assignment 
problem is the no-hole labelling, where one wants to use all the frequencies in 
the span. More precisely, we want that: (1) all the frequencies are used and (2) 
condition (1) being satisfied, the span must be minimum. Let ]0; M'] be the span 
of frequencies that we obtain. We then call the v number of G the value M' , 
and we denote it by r'J(G). We note that depending on the values of p, q and on 
the considered graph G, a no-hole labelling might not exist. In that case, we let 
I'qiG) = oo. Hence, we clearly have r'J (G) > A^(G) for any p, q and G. 

In this paper, we study a special case of no-hole L{p, q) labelling, namely 
where q = 0. We also focus on some specific topologies: cycles, d-dimensional 
hypercubes, 2-dimensional grids and 2-dimensional tori. For each of the men- 
tioned topologies cited above, we give bounds on the j/q number for any value of 
p > 1 and d > 1, and show optimality in some cases. We conclude the paper with 
new results concerning the (general, i.e. not necessarily no-hole) L{p, q) labelling 
of the d-dimensional hypercube, Hd. 

2 No-Hole L(p,0) Labellings 

Proposition 1 (General graphs). For any p > 1 and any connected graph G, 
if a no-hole labelling of G exists, then: 
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— r'o(G) > 2p — 1 if G is bipartite. 

— Vq{G) > 2p if G is not bipartite. 

Proof. In any graph such that a no-hole L{p, 0) labelling exists, there must, by 
definition, exist at least one vertex of label 0. Let u be this vertex. Then all the 
neighbours of u must be labelled at least p. Since the labelling is no-hole, all the 
labels in the range [0;p] must be used. This is true, in particular, for label p — 1. 
Let V he & vertex whose label is p — 1 . Then v has all its neighbours labelled at 
least 2p — 1. 

Now, let G be non bipartite, and suppose that r'g(G') < 2p — 1. G has at least 
one odd cycle. Consider the vertices on this cycle, and let j > 0 be the minimum 
label among them, assigned to vertex v. li i > p, then the neighbours of v, not 
being of minimum label, must be assigned a label at least 2p, a contradiction. 
Hence, t G [0;p — 1]. In that case, the two neighbours of v on the cycle, say wi 
and W 2 , are assigned labels at least p + i, that is in the range [p; 2p — 1]. But 
the neighbours of and W 2 on the cycle must be assigned labels in the range 
[0;p — 1], etc. If we repeat this argument, we see that, when we will close the 
cycle, since it is odd, we will end up with a vertex z whose two neighbours, say 
X and y, are such that x is assigned a label in the range [0;p — 1], while y is 
assigned a label in the range [p; 2p — 1]. As Vq{G) < 2p — 1, there is no possibility 
to label z in an L{p, 0) fashion, a contradiction. 

Observation 1 For any graph G of order n that admits a no-hole labelling, 
n> i^oiG) -hi 

Proof. Suppose a no-hole labelling for G exists. In order to be able to assign the 
vertices of G all labels in the range [0; t'g(G)], we must have n > ^'□(G) -I- 1. 

Proposition 2 (Cycles). For any p > 1 and any graph G: 

— r'Q(G„) = 2p for any odd n > 2p -\- 1 

— r'Q(G„) = 2p — 1 for any even n>2p-\-2 

Proof. First, suppose that n is even. According to Proposition 1, we know that, if 
a no-hole L(p, 0) labelling for G„ exists, then j/q(G„) > 2p— 1. By Observation 1, 
we must have n > 2p. However, let v be the vertex which is assigned label p. 
In that case, both its neighbours must be assigned label 0, because only labels 
in the range [0; 2p — 1] are allowed, and the gap between two neighbours is at 
least p. Hence, n > 2p -I- 1; but since n is even, we have n > 2p -\- 2. Suppose 
the vertices of G„ are numbered clockwise from 1 to n. We give the following 
labelling function c on Cn : (a) for any vertex v numbered 2i -|- 1 (0 < j < p — 1), 
c{v) = p-\-i (b) for any vertex v numbered 2i (1 < i < p), c(v) = i — 1 ; (c) for 

any vertex v numbered 2i-\-l,i > p, c{v) = 2p— 1; (d) for any vertex v numbered 
2i (i > p -I- 1), c{v) = 0 (cf. for instance Figure 1 (right), where n = 2p -|- 2). 

Now, we show that this assignment is an L(p, 0) no-hole labelling of G„, for 
any even n > 2p -|- 2. First, consider any two neighbouring vertices j and j T 1, 
1 < j < 2p — 1. If j is even j = 2i, then c(j) = i — 1, while c(j -1-1) = p -\- i, 
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thus the gap of at least p is satisfied. If j is odd j = 2i + 1, then c(j) = p + i, 
while c{j -I- 1) = i, which also satisfies the L{p,0) condition. Now consider two 
neighbours j and j + 1, with 2p+l < j < n — 1. If j is even, then c(j) = 0 while 
c{j + 1) = 2p — 1, and if j is odd, then c(j) = 2p — 1 while c{j -I- 1) = 0. There 
still remain two cases to consider: (1) Vertices 1 and n and (2) vertices 2p and 
2p + 1. However, in case (1) we have c(l) = p and c(n) = 0 , while in case (2) 
we have c{2p) = p — 1 and c{2p -|- 1) = 2p — 1. 

Consequently, the L{p, 0) condition is satisfied. There is no hole as from the 
definition of the labelling, one can see that all labels are used on vertices 0 to 
2p: odd vertices 2f-|-l,0<i<p— 1 have labels from p to 2p — 1, while even 
vertices 2i, 1 < i < p are assigned labels from 0 to p — 1. 




Suppose that n is odd. By Proposition 1, we know that, if a no-hole L(p, 0) 
labelling for exists, then i/q(C'„) > 2p. Now, clearly, if n < 2p — 1, then there 
is not enough vertices to use all the labels. Thus, we must have n > 2p + 1 in 
order that the no-hole labelling exists. Assume that n > 2p-|- 1, and the vertices 
of Cn are numbered clockwise from 1 to n. We define the following labelling 
function c to Cn' (a) for any vertex v numbered 2z -|- 1 (0 < i < p), c(v) = p — i 
(b) for any vertex v numbered 2i (1 < i < p), c(u) = 2p — i + 1 ; (c) for any 
vertex v numbered 2i -|- 1, i > p -I- 1, c{v) = 0 ; (d) for any vertex v numbered 2i 
{i > p + 1), c{v) = p (cf. for instance Figure 1 (left), where n = 2p -|- 1). 

The labelling is an L(p, 0) no-hole labelling of C„, for any odd n > 2p-|- 1. 
First, consider any two neighbouring vertices j and j + 1, 1 < j < 2p. If j is even 
j = 2i, then c(j) =2p — i+l, while c{j -1-1) = p — i, thus the gap of at least p is 
satisfied. If j is odd j = 2i+ 1, then c(j) = p — i, while c(j -I- 1) = 2p — i, which 
also fulfills the L{p, 0) condition. Now consider two neighbours j and j + 1, with 
2p + 2 < j < u — 1. If j is even, then c(j) = p while c(j -I- 1) = 0, and if j is 
odd, then c(j) = 0 while c(j -I- 1) = p. Now there remains some cases to consider: 
(1) j = 2p + 1 and j -|- 1 = 2p -|- 2 and (2) j = n and j + 1 = 0 (that is, we 
“close” the cycle). But in both cases, we have c(j) = 0 and c(j -I- 1) = p. Thus, 
altogether, the L{p, 0) condition is satisfied. Now, by definition of the labelling, 
we can see that all the labels are used on vertices 0 to 2p: vertices of the form 
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2i + 1, 0 < i < p are assigned labels from 0 to p, while vertices of the form 2i, 
I < i < p are assigned labels from p + 1 to 2p. 

Proposition 3 (Hypercubes). For any d— dimensional hypercube Hd such 
that d > 2^, VoiHd) = 2p-l. 

Sketch of Proof. By Proposition 1, i'Q(Hd) > 2p — 1. We will first show that 
1^0 (Hd) < 2p — 1 (thus, proving the equality) for any d > 2p — 1 ; then, we will 
show that this result can be extended to any d > Suppose d > 2p — 1. The 
fact that v^^Pld) < 2p — 1 is proved by homomorphism into the following graph 
Gp'. (a) the nodes of Gp are the integers between 0 and 2p — 1 and (b) there is an 
edge between u and v in G'^ \E \u — v\ > p. Clearly, G^ represents the constraints 
on the L{p, 0) labelling, in the sense that any edge (m, v) of G), indicate that 
labels u and v can be assigned to neighboring nodes in Hd- We want to find an 
homomorphism H from Hd to Gp, i.e. we want to find a mapping from V{Hd) 
to V (G'), where every node v has an image h{v) such that any edge {u, v) in Hd 
corresponds to an edge {h{u), h{v)) in Gp. If we can do this, then we can find a 
labelling (more precisely, c{v) = h{v) for any node v) that satisfies the L{p,0) 
constraints. Furthermore as this labelling has to be no-hole, we also need that 
every node of Gp is an image of at least one node of Hd. Let us partition the 
nodes of Hd into d + 1 sets: for any 0 < f < d, the set Si corresponds to the nodes 
having i bits equal to 0 in its binary coordinates. By definition of the hypercube, 
for every 0 < i < d, S'i is a stable set. In other words, all edges appear between 
different SiS. More precisely, all the edges of Hd appear between an Si and an 
S'i+i. Let us define the homomorphism so that all nodes belonging to the same 
Si have the same image by H. Let hi be the image by H of all the nodes of 
Si. Then, for any l<j<d— 1, must be connected in Gp to both di+i and 
hi-i. Moreover, /iq must be connected to hi, and hd-i must be connected to hd. 
Hence, this induces a path starting at ho, and ending at hd, with edges (hi, hi^i) 
for any 0 < i < d — 1. But we also want this labelling to be no-hole, hence this 
path must be hamiltonian. In other words, if we are able to find a hamiltonian 
path in Gp, then there exists a homomorphism of Hd into Gp. Clearly, since we 
have d -|- i sets Si, and since each one has a unique image in Gp, we must have 
d -I- 1 > 2p, that is d > 2p — 1 . 

Finally we need to show that Gp contains a hamiltonian path ; it is as follows: 
p, 0,p-|-l, 1, . . . , i,p+i, i+l,p+i+l, . . .,p—2, 2p—2,p—l, 2p—l (cf. Figure 2(left)). 

Let Vj be any node of set Sj, 0 < j < d. If j = 2i, we set h{v 2 i) = p + i for 
every 0 < i < p — 1 and if j = 2i + 1, we set h{v 2 i+i) = * for every 0 < i < p — 1. 
Finally, for any j > 2p, if j is of the form 2p -|- 2i, we set h{vj) = 2p — 1, and if 
j is of the form 2p -|- 2z -|- 1, we set h{vj) = 0. 

We can show that the above result can be extended for any d > p -I- 1. This 
is obtained using the same kind of argument (that is, homomorphism into G'p), 
but with a better mapping of the nodes (cf. Figure 2(right)). 

The same goes to extending the result for any d > hence proving the 
proposition. Here again, the proof technique is the same as for the two previous 
cases. Roughly speaking, we consider Hd as 4 copies of idd- 2 , connected between 
them by 2 perfect matchings. □ 
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Fig. 2. (left) Homomorphism of Hd into G'p, with d = 7 and p = 3 ; (right) Another 
homomorphism of Hd into Gp, where nodes are represented by squares 



Observation 2 For any p >2, if ^^{Fld) = 2p — then p <2‘^ ^ — 

Proof. Suppose that a no-hole L{p,0) labelling of exists, with = 

2p — 1. Since Hd has 2‘^ vertices, we must have 2'^ > 2p. Since p > 2, labels 
0,p — l,p and 2p — 1 are pairwise distinct. Moreover, by definition, there must 
exist a vertex labelled p, whose d neighbours must then be labelled 0. The same 
goes for any vertex labelled p — 1, whose d neighbours must then be labelled 
2p— 1. Hence, those four pairwise different labels are present on at least 2{d+ 1) 
vertices, leaving 2p — 4 labels to be present on at most 2^ — 2{d + 1) vertices. 
Thus, we must have 2p — 4 < 2^ — 2{d + 1), which gives the result. 

Proposition 4. If v^(Hdo) = 2p — 1 for a given dimension dg, then Vg(Hd') = 
2p — 1 for any d' > do. 

Proof. Consider Hdg , for which we have a no-hole L{p, 0) labelling with Vq {Hdf)= 
2p — 1. We will show here a way to obtain a no-hole L{p, 0) labelling of Hdg+i 
from the labelling of Hdg. We recall that Hdg+i is obtained from two copies of 
Hdg joined by a perfect matching. Now consider a copy of Hdg, having a no-hole 
L{p, 0) labelling with VQ{Hdg) = 2p—l: necessarily, there must exist a vertex in 
Hdg, say X, whose label is p. Wlog (since hypercubes are vertex transitive), let 
the binary coordinates of x be as follows: x = (0, 0 ... 0). Since i^oi^dg) = 2p — 1, 
all the neighbours of x must be labelled 0. Moreover, all the neighbours of x have 
exactly one bit equal to 1. By the same argument, we can see that all the vertices 
having 2 bits equal to 1 must be labelled in the range [p; 2p — 1], while all the 
vertices having 3 bits equal to 1 must be labelled in the range [0;p — 1]. More 
generally, all the vertices having an even (resp. odd) number of bits equal to 1 are 
labelled in the range [p; 2p— 1] (resp. [0;p— 1]). Now, take a second copy of Hdg, 
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and label each vertex having an even (resp. odd) number of bits equal to 1 with 
label 0 (resp. 2p— 1). This labelling is an L{p, 0) labelling, but it is not no-hole. 
If we connect the corresponding vertices in both copies of Hdo , the labelling we 
obtain remains L{p,0) (vertices labelled in the range [p;2p— 1] (resp. [0;p— 1]) 
in the first copy are connected to vertices labelled 0 (resp. 2p — 1) in the second 
copy). Moreover, since it is no-hole in the first copy of Hdg, it remains no-hole 
in Hd„+i. 

Proposition 5 (2-Dimensional grids x Pm)- For any p and n > m > 1 

we have: r'o(Pn x Pm) = 2p — 1, where 

1. n ■ m — m + 1 > 2p if n is even and m is odd, 

2. n ■ m — m >2p otherwise. 

Sketch of Proof: Fill in the x Pm grid (i.e. n rows and m columns) in 
the chessboard mode. Like in the chessboard where we have white and black 
alternating squares, we have in the “white” squares the labels from the range 
[p; 2p—l] and in the “black” squares the labels [0; p— 1] . Without loss of generality 
assume that the left upper square is white. Take the following labelling: put p 
in the left upper corner and subsequently put in the white squares from left to 
right and row by row the upper range labels: p + l,p + 2, ...,2p — 1. In the last 
row put in all white squares 2p — 1. Further put 0 into all “black” squares of 
the first row of the grid. Starting with the left most square in the second row of 
the grid, we subsequently put into the “black” squares lower range labels from 
[0; p — 1]. The labelling is no-hole L(p, 0) and 2p = m{n — 1) -|- 1 if to is odd and 
n is even. Otherwise 2p = m{n — 1). □ 

Below is an example of 2-dimensional grids G(5,6) and G(5,5). 
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Fig. 3. No-hole L(p, 0) labelling in 2D grids G(n,m): (left) m = 5,n = 6,p = 13; 
(right) m = n = 5,p — 10 



Proposition 6 (Consequence of Proposition 5). For anyp, q > 0 and d> 4 

we have: Ug^Hd) = 2p — 1 when 

1. {\d/2\ +l)\d/2'\ +2>2p if d is odd, 

2. {d/2 + 1)^ — d/2 >2p if d is even. 



Proof. Proof follows by combining Propositions 3 and 5. 
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Remark 1. Similar results to Propositions 5 and 6 can be obtained for the 3 and 
higher dimensional grids. The results will appear in the full version of this work. 
Below is an example of a no-hole labelling of a 3-dimensional grid (see Figure 4). 
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Fig. 4 . A L(24, 0 ) no-hole labelling in P4X P4X P4 with = 47. Each of the 4 blocks 
represents a 2-Dimensional subgrid P4 x P4, in which each square represents a node 



By direct application of Proposition 1, we get the following result for the 
2-dimensional toroidal meshes. 

Proposition 7 (Tori). For any p>l and n,m> 3.' 

— r'Q(C„ X Cm) > 2p — 1 if n and m are both even 

— ixQ{Cn X Cm) > 2p otherwise 




Fig. 5. L(p, 0) no-hole labelling in 2D-tori; an example for C 4 x C 4 where p = 4 



3 L{p, q) Labellings of Hypercubes 

We conclude this paper by giving new results concerning the (general, i.e. not 
necessarily no-hole) L{p, q) labelling of Some results on this topic have been 
given in [7]. However, it is possible to improve them. This is the purpose of 
Proposition 8 below. 

Proposition 8. For any p,q,>0 and d> 1: 

1. Xl{Hd)=p 

2. (d-l)g<A«(iLd)<(2d-3)g 
3- XP{Hd)<2p+{2d-2)q-l. 
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Proof. (1) For any d > 1, Hd has at least two vertices. Consider a vertex v 
of Hd that is assigned label 0 in a L(p, 0) labelling of Hd- Snch a vertex mnst 
exist, otherwise every label conld be decreased by at least 1, leading to a better 
solntion. v has at least one neighbonr w, whose label mnst then be greater than 
or eqnal to p. Hence, v^^Hd) > p. We can show that v^^Hd) < p hy noticing 
that Hd is bipartite. Thns, if all the vertices of the first (resp. second) partition 
of Hd are labelled 0 (resp. p), the labelling we obtain is L{p, 0). 

(2) Take any vertex u of Hd- It has d neighbors, all of them lying at distance 
2 from each other. Hence, those d vertices mnst be assigned labels that differ by 
at least q. Since labels can begin at 0, it follows that the greatest label is greater 
than or eqnal to {d — l)g, showing that X^{Hd) > {d — l)q. The npper bound 
is obtained by the following labelling: suppose each node v = {xi,X 2 ■ ■ -Xd) is 
defined by its (binary) coordinates in each of the d dimensions of Hd ; for any 
vertex v = {xi,X 2 ■ ■ - Xd) of Gd, we define c{v) = (X)fc=i kqxk) mod (2d — 2)q. 
Since p = 0, we only need to consider two vertices u and v lying at distance 
2 in Gd, thus differing on two coordinates, say Xi and Xj, 1 < i j < d. We 
will consider two cases here: (a) j = d and (b) j d. In case (a), we have 
\c{u) — c(n)| = iq mod {2d — 2)q, and since 1 < i < d — 1, |c(m) — c(n)| > q. In 
case (b), we either have \c{u) — c(n)| = {i + j)q mod (2d — 2)q or \c{u) — c(n)| = 
(j ~ *)? mod {2d — 2)q, but since 1 < i < d — 2 and i + 1 < j < d — 1, 
\c{u) - c(n)| > q. 

(3) Suppose each node v = {xi,X 2 ■ ■ ■ Xd) is defined by its (binary) coordinates 

in each of the d dimensions of Hd- Every node v = (a;i, X 2 ■ ■ ■ Xd) is then assigned 
label c{v) = + (* ~ l)?)^^! mod {2p + (2d — 2)q). Take two neighbors 

u and V in Hd, which thus differ on exactly one coordinate, say in Xj. Thus 
|c(n) — c(u)| = p+{j — l)q mod (2p+(2d — 2)g), that is |c(n) — c(u)| = p+ {j — l)q. 
Since 1 < j < d, we have that |c(n) — c{u) \ > p. Now consider two nodes differing 
on two coordinates i and j, where, wlog, i < j (since we are in Hd, necessarily 
i ^ j). We have two cases here: (a) Xi = Xj or (b) Xi Xj. In case (a), we obtain 
that |c(n) — c(m)| = 2p + {i + j — 2)q, which is clearly greater than or equal to 
q. In case (b), |c(n) — c{u)\ = {j — i)q, which is also greater than or equal to q 
since j ^ i. 

Remark 2. 

— Concerning the L(0, q) labelling of Hd, we can show that \^^{H^) = A°(id 4 ) = 

3g ; 

— We note that Proposition 8(3), when applied to the case p = 2 and 9=1, 
gives XP{Hd) < 2d+ 1, a value which coincides with the lower bound proved 
in [8]. 



4 Conclusion 

In this paper, we have mainly considered the no-hole L{p, 0) labelling in different 
topologies, such as cycles, hypercubes and 2-dimensional tori. We also gave some 
bounds for the (not necessarily no-hole) L{p, q) labelling of hypercubes, that 
improve the ones from [7]. 
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Concerning no-hole L(p,0) labellings, we have mainly based our study on 
the cases for which there exists a no-hole L{p,0) labelling having the minimum 
number of labels (that is, minimum as stated in Proposition 1). Indeed, depend- 
ing on the respective values of n (number of nodes of the considered network) 
and p, such a no-hole L{p, 0) labelling might not exist. Also, as remarked above, 
our work concerning 2-dimensional grids can be extended to any d-dimensional 
grids, d > 3 (this will appear in the full version of this work) . We also note that a 
natural extension of this work is to study the no-hole L{p, q) labelling of graphs, 
for any p, q. 
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Abstract. The problem of routing traffic through a congested network 
is studied. The framework is that introduced by Koutsoupias and Pa- 
padimitriou where the network is constituted by m parallel links, each 
having a hnite capacity, and there are n selfish (noncooperative) agents 
wishing to route their traffic through one of these links: thus the problem 
sets naturally in the context of noncooperative games. Given the lack of 
coordination among the agents in large networks, much effort has been 
lavished in the framework of mixed Nash equilibria where the agent’s 
routing choices are regulated by probability distributions, one for each 
agent, which let the system reach thus a stochastic steady state from 
which no agent is willing to unilaterally deviate. Recently Mavronicolas 
and Spirakis have investigated fully mixed equilibria, where agents have 
all non zero probabilities to route their traffics on the links. In this work 
we concentrate on constrained situations where some agents are forbid- 
den (have probability zero) to route their traffic on some links: in this 
case we show that at most one Nash equilibrium may exist and we give 
necessary and sufficient conditions on its existence; the conditions relat- 
ing the traffic load of the agents. We also study a dynamic behaviour 
of the network, establishing under which conditions the network is still 
in equilibrium when some of the constraints are removed. Although this 
paper covers only some specihc subclasses of the general problem, the 
conditions found are all effective in the sense that given a set of yes/no 
routing constraints on each link for each agent, we provide the probabil- 
ity distributions that achieve the unique Nash equilibrium associated to 
the constraints (if it exists). 



1 Introduction 

We study the problem of routing traffic in a simple context proposed by Kout- 
soupias and Papadimitriou in [11]. We are given a network of m parallel links, 
each having a finite capacity (traffic rate) of carrying the traffic w\,. . . , Wn of 
n agents. Each agent has an indivisible amount of traffic that wishes to route 
through a link and it has a delay given by the overall traffic routed by all the 
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other agents on that link at the given link rate. In large networks, it is reasonable 
to assume a selfish behaviour of the agents, that is a lack of coordination^ (see 
[11, 17]). Intuitively, each agent aims at minimizing the delay of its own traffic. 
Thus the problem sets naturally in a context of noncooperative games where the 
routes chosen by the users form a so called mixed Nash equilibrium, in the sense 
of classical game theory: a stochastic steady state of the game ([15, 16]). A prob- 
ability distribution to route the traffic of an agent through the links, is called 
a strategy for that agent. Given a set of n strategies, the network converges to 
an equilibrium point (steady state) such that each agent has no convenience to 
modify unilaterally its routing strategy: in game theory this is called a Nash 
equilibrium (see [15, Chapter 3] for deep interpretations of mixed strategy Nash 
Equilibrium and [14]). Since Nash equilibria often lead to a performance degra- 
dation of the network, in [11] (where the authors introduce also the idea to 
calculate Nash Equilibrium as a solution of linear equation system, see also [12]) 
it has been proposed to measure this degradation in terms of coordination ratio: 
the ratio between the worst possible Nash equilibrium and the value of the op- 
timum performance. Remarkable results have been given in [3, 10] establishing 
asymptotically tight bounds on the coordination ratio and also in [1] where a 
model similar to ours, so called the subset model, is considered. Anyway since, 
as remarked also in [16], it is not known any algorithm to find a Nash equilib- 
rium given a network of links and agents, in [11] it has been noticed that if we 
are given a set of n supports (a vector of m binary variables indicating whether 
an agent routes its traffic with non zero probability or not on each of the m 
links) then at most one Nash equilibrium exists and this can be computed in 
POLY {mn) time. In [13] necessary and sufficient conditions on the existence 
of fully-mixed Nash equilibria have been given, that is the case in which each 
agent has non zero probability to route its traffic on every link (to note that 
the conditions can be computed in Many other interesting results have 

been published: some of them (see [8, 9, 17]) concern cases of pure or fully-mixed 
strategies: either an agent has probability 1 to route its traffic on a particular 
link and probability 0 on all other links or it has non zero probability, different 
from 1, to use every link (in this case all supports are identical); the paper [4] 
concerns with the time the system takes to converge to a pure Nash Equilibrium, 
while [5] deals with the problem of converting any given non-equilibrium routing 
into a Nash equilibrium without increasing some measures of costs (see also [7] 
where similar results have been given for the model of unrelated links where each 
user is only allowed to ship its traffic on a subset of allowed links) . 

In this paper we aim at establishing conditions on the existence of Nash 
equilibria in presence of mixed supports. Actually our results fits in a setting of 
constrained routing: given a set of n supports, one for each agent, we investigate 
the case when an agent is forbidden to route its traffic on some links. This means 
that in the case of Nash equilibrium the game does not take into account the 



^ Note that, even in the case when all the links have equal capacity, the problem to 
minimize the maximum delay of the agents is an NP-complete problem (reduction 
from the Partition problem). 
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agent’s cost of routing on the forbidden link. To make more clear the description 
and motivation of our results, let us consider an equivalent view of the network: 
each agent is connected to m sources and each source has a private link with 
a single destination. With this setting in mind, we address situations that may 
arise from a total congestion or a fault of some link between agents and sources, 
thus forbidding some of them to route their traffic through these links. The 
question we wish to answer here is: “under which conditions are there Nash 
equilibria when some agents are constrained not to route their traffic on some 
links?” We point out that given a network and a set of constraints, at most 
one Nash Equilibrium which fulfills this set of constraints may exist. We give 
necessary and sufficient conditions on the existence of equilibrium, relating the 
traffic weights and we also show which are the strategies of the agents, yielding 
thus the conditions effective and computable in POLY{mn). More precisely, 
given a set of constraints describing on which links the agents may or may not 
route their traffic, we provide conditions for the existence of a Nash equilibrium 
providing also the unique strategy for each agent associated to the constraints. 
Moreover since the number of equilibria is, in general, exponential in m and n 
(the number of possible supports), our results are given for classes of constraints 
parameterized in the number of agents that will never route traffic on a link and 
in the number of links which will never carry traffic of some agents. 

We also study a dynamic behaviour of the network: given n strategies, one 
for each agent, yielding a Nash equilibrium fulfilling a given set of constraints, 
we establish under which conditions the strategies are still a Nash equilibrium 
when the set of constraints changes. In particular we investigate the case when 
some of the constraints are removed and the agents are free to route their traffic 
(for a more extensive discussion of our results we refer the reader to Section 1.1). 

Organization of the Paper. The paper is mostly self-contained and is orga- 
nized as follows: in the next subsection we give some preliminary definitions and 
list our results. In Section 2, we first introduce some notation used throughout 
the paper and then give two preliminary results that will be exploited in Sec- 
tion 3 to show the necessary and sufficient conditions for the existence of Nash 
equilibria when the agents route their traffic subject to the constraints and when 
these are changed. Finally we give some short conclusions in Section 4. 



1.1 The Model and Our Results 

In this subsection we give some preliminary definitions, mostly following the 
notation of [3] and [11]. For all natural numbers x, y, [x, y] denotes the set {x, x + 
l,...,j/}, when a: = 1 we simply use [yj. Consider a network N constituted 
by m parallel links, each having capacity respectively, on which n 

agents wish to route their unsplittable traffic, of length wi, . . . ,Wn, respectively. 
If Si = ... = s„, the system is called uniform. In what follows, m and n 
will always be the number of links and the number of agents in the network, 
respectively. A strategy Pi for an agent i, is a probability distribution P/, 
j G [m], on the set of the m links. In particular a pure strategy is a strategy 
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such that P/ G {0, 1} (that is agent i sends its traffic on a unique link with 
probability 1) and a fully mixed strategy Pi is a strategy such that P/ G (0, 1), 
for all j G [m]. A mixed set of strategies P = {Pi,...,P„} is a set of 
n strategies (one for each agent). Let us remark that the term mixed is used 
to underline that a set can contain strategies that are pure, fully mixed and 
strategies that are none of these two kinds, as well. In fact, as said above, almost 
all known results are for either pure or fully mixed strategies, while the present 
paper deals with solutions of mixed sets of strategies. A support Si associated 
to a strategy Pi is a set of indicator (binary) variables Si with j G [m] such 
that Sf = 0 if and only if P^ = 0. A set of supports S = {S'!, . . . , 5'„} is a set 
of n supports (one for each agent). The expected cost of agent i on link j is 

4 = {'Ek^iPkWk + Wi)/sj. 

Our scenario, with the constraints on the routing of agents, is the following: 
a constraint Fi associated to agent i is a set of indicator (binary) variables F( 
with j G [to] such that P/ = 0 if and only if agent i cannot (is forbidden to) route 
its traffic on link j. A set of constraints (one for each agent) F = {Pi, . . . , P„} 
is a set of n constraints. A strategy Pi is compatible with a constraint P if 
whenever P/ = 0 then P/ = 0, for j G [to]. Similarly a support Si is compatible 
with a constraint P if whenever P/ = 0 then Si = 0, for j G [to]. In this setting, 
we can say that a set of strategies P is a Nash equilibrium if 

P/ > 0 implies that c{ < c\ i G [n] and j, I G [to] s.t. Si = 1 and P{ = 1. (1) 

Informally, a mixed set of strategies P = |Pi,...,P„} on a network N, is a 
Nash equilibrium if there is no incentive for any agent to modify unilaterally 
its strategy. From this it follows that in a Nash equilibrium the costs of an 
agent i G [n] are all equal on all links j G [to] such that P/ > 0. We can now 
define our problem. 

Definition 1. Given a network N , a set of constraints F and a set of supports 

5 compatible with F, Route(N, P, S) problem is the problem of finding a Nash 
equilibrium P such that S is associated to P. 

In the rest of the paper, except where otherwise stated, we assume that a 
Route(N, F, S ) problem is for uniform networks, that is a network with si = 
. . . = Sm- Hence a solution to a Route(N, F,S) problem is given by solving the 
following inequality system (if solution exist): 

{ d < c( y i G [n], j, I G [to] s.t. si = 1 and P/ = 1 

P/=0 ViG[n], jG[to] s.t. . 

p/ > 0 y i G [n], j G [to] s.t. si = 1 

ET=,Pi = iyiG[n] 

Equivalently and more practically a solution to the Route (N, F, S) problem can 
be computed by first solving the following system of to n equations in to n 
unknowns: 
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( d = c[ V i e [n], G [to] s.t. si = 1 and S\ = 1 

\pi = 0 V z e [n], j G [to] s.t. si = 0 (3) 

lEr=i^' = iv*e[n] 

and then by simply verifying whether the given solution is a probability distri- 
bution inducing a Nash equilibrium satisfying condition (1). From this it follows 
that Route(N, F,S) problem has at most one solution. 

We now define the case in which for a given link there exists only one agent 
with non zero probability on this link. Consider a Route(N, F, S) problem, non 
necessarily uniform, and a mixed set of strategies. A link j is an s-link if and 
only if there exists i G [n] such that P/ > 0 and = 0 for all k ^ i (in [13] 
this is called a solo link). The agent i is called s-agent for the s-link j. 

Consider a Route (N, F,S) problem (not necessarily uniform) and a strategy 
Pi for an agent z G [n]. In order to treat mixed sets of strategies we introduce 
two measures: a set of supports S is called a /3-set of supports if and only if all 
the links on which the agents do not route their traffic is exactly /?. A (a, [3)-set 
of supports is a [3-set of supports in which there are a supports that induce 
1-strategies and (rz — a) supports that induce fully-mixed strategies. 

Given two sets of constraints, F and H, we introduce the following partial 
order relation on them: F < FI li F^ < Flf, for all z G [zz],j G [to]. 

Our Results. In Section 2 we consider a Route(N, F, S) problem, not necessarily 
uniform, with an s-link and give a necessary condition for the existence of a Nash 
equilibrium relating the capacity of the links to the traffic weights of the agents. 
From this result we obtain, as a corollary, a result presented in [6]. Then we 
present a lemma showing that, given a Nash equilibrium associated to a set of 
supports, if some agents are forbidden to route their traffic on a pair of links, 
then the remaining agents have the same probabilities to route their traffic on 
these two links. In Section 3 we give effective necessary and sufficient conditions 
for the existence of Nash equilibria in the case of (a, /3)-set of supports, first in 
the case of /3 = 1 and 1 < a < (n — 2) and then for a = [3, for a given set 
of constraints F. In these cases we show also that if the network constraints 
change and we are given a new set of constraints H, such that F ^ H , then the 
conditions above still hold to get the same equilibrium. 

2 Notation and Preliminary Results 

Given a Route(N, F, S) problem (not necessarily uniform), we will adopt the 
following notation: 

— a is the number of agents with neither pure nor fully-mixed strategies; 

— Li = {j G [to] I S[ = 0}, i.e. a link j is in Li if agent z does not route its 

traffic on it. L = and /? = \L\. 

— for any set of agents G, and are, respectively, the minimum and 

the maximum weight of jobs of agents in G. Moreover, Tq is the sum of the 
jobs of the agents in the set G. 
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— for any set of agents G, , where l,h G {0, • • • , |G|}, denotes the set obtained 
from G by deleting the agents with I lowest jobs and h highest jobs. 

— = {i e [n] I Sf = 0}, contains the set of agents that do not route their 
traffic on link j. [A^] = {I ^ [m] \ Sj = 0 if and only if S'- =0, for all i G 
[n]}, i.e. [A^] is an equivalence class containing links on which the same set 
of agents do not route their traffic. A = Uj^[rH]A^ and a = \A\. 

Now we give two results which will be used in the next section. The first is an 
(effective) necessary condition for the existence of a solution to the Route(N, F,S) 
problem such that S induces at least one s-link. The second result is a technical 
lemma which establishes when it is possible to reduce the dimension of sys- 
tem (3). Actually the so obtained system of equations, will be the starting point 
to describe (and compute) the solutions to the route problems presented in the 
rest of the paper. 



Proposition 1. Consider a Route(N, T', S') problem (not necessarily uniform) 
and a mixed set of strategies P associated to S. If there exists an s-agent ii for 



an s-link I and a link j A- I such that S( = 1 and — yf 
the set of strategies P is not a Nash equilibrium. 






— h 1, then 



Proof. From definition of expected cost of an agent, it results that = Wijsi. 
Therefore in a Nash equilibrium it must hold = c\^ for all j G [m] such that 
P( > 0. That is Pt^t + w^)/sj = WiJ si for all j G [m] such that P(^ > 0, 

thus yielding Pt'^t)/wii + 1 = Sj/si for all j such that P/^ > 0 which 

completes the proof. □ 



This proposition in fact can be specialized to the case of uniform links, getting 
so the following corollary (which resembles Proposition 1 of [6]). 

Corollary 1. Consider a uniform Route(fV, P, S) problem. If there exists an s- 
agent f for an s-link I such that a link j yf I and an agent k ^ ii exist with 
S( = Si = 1, then there is no solution for the Route(fV, P, S) problem. 

Given a set of supports S = {Si, . . . , S„}, let us abuse in the notation (for 
clarity of exposition) and treat S and each Si as ordered vectors, thus S can 
be seen as a boolean matrix with columns SCj G [m\. The following lemma 
establishes that given a Route(N,F,S) problem if S^ = S\ j,l G [m] and a 
set of probability distributions {Pi) exists inducing a Nash equilibrium, then 
P( = Pj for all z G [n]. In words, when agents i\,. . . ,ik are forbidden to route 
their traffic on a pair of links j and I then, in a situation of Nash equilibrium, 
the remaining agents have all equal probabilities to route their traffic on links j 
and 1. 



Lemma 1. Consider a Route(N, P, S) problem and let Pi = {P(, . . . , P/"} be a 
Nash equilibrium associated to S, for all z G [n]. Then, given two links I ^ j G 
[Ai] = [A^] it holds that P/ = P( for all z G [n]. 
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Proof. The proof is by contradiction. Let l,j S [A^] and assume, by way of 
contradiction, that agents i with Pf P^ exist (without loss of generality 
let i G [t] for some t > 0). Thus P/ yf P/ for 1 < i t and P/ = P/ for 
(t + 1) < i < n. Moreover, let 1, . . . , z be the agents with non zero proba- 
bilities on links I and j. The proof for t = 0 and for t = z is trivial. Thus 
consider the case 1 < t < z and let P^ < P/, for i G [t]. Because Pf is 
a Nash equilibrium, then c\ = must hold. Anyway from the assumptions 
T,k=t+iHwk = J2k=t+iHwk, thus J2l=i,k^iPtwk = Y.l=i,k^,Plwk and 
since P‘ < P( then J2k=i Pk^k < J2k=i Pk'^^k, yielding that c[ < cl for all 
t < r < z, which is the desired contradiction. □ 

From this lemma, we can show an equivalent reduced system of linear equa- 
tions. Without loss of generality, we assume that if there are v different classes 
of equivalence, then [A^] yf [A^] yf ... yf [A^\. In other words, links 1, . . . ,?; 
are all in different classes of equivalence. 

{4-c[ = Q y j G[v],(iG r\t=j+i H -L,- {j} 

\pi = Pl yiG[v], jG[A^]-{ 1}, t^A^ ( 4 ) 

I P/ = 1 V f G [n] 

It easy to see that the solutions of this system have to be necessarily greater 
than zero in a Nash equilibrium. 

3 Nash Equilibria 

Consider a uniform Route(N, F, S) problem. In this section we will establish 
effective necessary and sufficient conditions on the existence of a solution to 
the problem. As said above a necessary condition is that the system ( 4 ) has 
all solutions in (0, 1]. Moreover, we consider the case in which some or all of 
the constraints are removed (a Route (N, H,S) problem with F < H) and show 
sufficient conditions for the solution of the Route (N, F,S) problem are a solution 
for the Route(N, H,S) too. In particular, we study {a, /3)-sets of supports in some 
particular cases. For simplicity, we assume in the rest of paper, that the a agents 
with zero probabilities are the agents 1, . . . , a, that is A = {!,...,«} and the 
links with zero probabilities are the links 1, . . . /3, that is L = {1, . . . , /?}. In the 
following proofs, we make use of solutions of the system ( 4 ) which have been 
calculated using the software Matlab© to found the base cases and then by 
guessing the solution for the generalized problem. 

3.1 (a, 1)-Sets of Supports, 1 < a < n — 2 

All the a agents are forbidden to route the traffic on the first link. 

Theorem 1. Consider a Route(A^, P, S) problem such that S is an (a, l)-set of 
supports with I < a < (n — 2) and F = S. A solution to Route(A^, P, S) exists if 
and only if 
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T[a] < (to - 1) [n - (a + ■ 

In this case the Nash probabilities are 



P = 



0 

1 

m — 1 

[n— (a+l)]iOi+T[c,] 

/^[n— (o:+l)]iL»j 
(m— l)[n— — T[ q,] 
m{m—l)[n—{cx-\-l)]wi 



l = 1 and i G [a] 

I G [2, to] and i G [a] 

I = 1 and i G [a+ l,n] 

I G [2, to] and i G [a + l,n] 



( 5 ) 



( 6 ) 



Proof. In this case the system (4) has the values P/ given by (6) as solutions. 
As TO > 1, Pf G (0, 1) for all i G [a + 1, n] and obviously P^ G (0, 1) for i G [a]. 
Moreover, as Pj = for i G [a + 1, n] and I G [2, to], then Pj G (0, 1) if and 
only if P/ G (0, 1) for i G [a+ 1, n] and I G [2, to]. Therefore, from equations (6), 
the P/ are a set of strategies if 



— [n — (a + l)]wi < P[q] < (to — l)[n — (a + l)]wi V t G [n] — [a]. 

These (n — a) inequalities can be equivalently verified by just verifying the 
inequality P[q,] < (to— l)[n— + which gives the necessary condition. 

The sufficient condition is trivial. Indeed, it is easy to see that if the rela- 
tion (5) holds, then Pf G (0, 1) for alH G [a -I- l,n] and therefore the P/ are a 
set of strategies. From this, as the P/ are solutions of the system (4), they are a 
Nash equilibrium. □ 



Removing some constraints we can get the following theorem. 

Theorem 2. Consider a Route(A^, P, S) problem such that S is an (a, l)-set of 
supports with I < a < {n — 2) and F = S, and a Route(N, H, S) such that 
F ^ H. If the solution to the Route(A^, P, S') exists, then also the solution to the 
Route(A^, P, S) exists. The Nash probabilities for the Route(fV, P, S) are those 
given in (6). 



Proof. If a solution to the Route(N, F,S) exists, then the relation (5) holds and 
obviously P = {P/} is a set of strategies then it remains only to prove that it is a 
Nash equilibrium. Since for the agents i G [n] such that P/ = 0 or z G [a+ l,n], 
P/ > 0 for all j G [to] such that Hf = 1, let us concentrate on the agents z such 
that Pf = 0 and Hf = 1. For these agents it results: 



= — V 

m L^k>OL 

= ^ J 2 k>a wk 



I {n—oc-\-mn—ocm—m)v 
m[n-(a-|-l)] 



(n-a) 



k<cx ,k^i 



{m[n— (a+1)] — n+a} 



m[n— (a+1)] 



and 



m(m — l)[n— ( q+1)] 



k<cx ,k^i 



{m(m— 1) [n— ( q:+ 1)] — n+o:}t 
m(m— 1) [n— (cn+l)] 



for I G [to] /[I], from which c( — c) 



E wi.-l-(n—a)wi 
k<a,k^i ^ ^ ^ 

(m-l)[n-(a+l)\ 



< 0 for all I > 1, 



therefore the set of strategies P yield a Nash equilibrium for the Route(N, P, S ). 



□ 
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3.2 (/3,/3)-Sets of Supports, (3 — m > 2 

Agent i is forbidden to route its traffic on link z, z G [a] and a = [3 = m. 

Theorem 3. Consider a Route(-/V, F, S') problem such that S is a {(3, (3) -set of 
supports, n>m = /3 = a>2 and F = S. A solution to Route(A^, F, S) exists if 
and only if: 

T[-m]l > (to - 2)wI^L - (to - 2)[(to - l)n - {2m - 

T[m]o < (to - 2)zc|™'„ + [(to - l)n - (2 to - l)]w^Zt 

> (to - - (to - 1)[(to - l)n - (2 to - 

^[m]0 < (to - + [(to - l)n - (2 to - 

In this case the Nash probabilities are: 



P = 



0 

(m— i)i(™ — l)n—(2m—l)]wi 
[(m — l)n— (2m — l)]it;i + (m — 

m[(m — l)n— (2m — l)]iL»j 



z G [to] and I = i 
i G [to] and I G [to]/{z} 

I G [to] and z G [to + 1, zz] 



(8) 



Proof. In this case the system (4) has the values P/ given by (8) as solutions. 
A necessary condition for the existence of a Nash equilibrium is that system (4) 
has solutions in (0, 1]. In particular, as to > 1, P/ G (0, 1) for all z G [a + l,n], 

i.e. 

fO<Pj*<lVzG [to], I G [to]/{z} 

0 < Pj* < 1 V z G [to + 1, zz], ? G [to] 



must hold, that is: for z G [to] and I G [m]/{i\ 



f J2k<m,k^i,i Wk> {m- 2)wi - (to - 2) [(to - l)zz - (2zzz - l)]wi 
I Wk<{m- 2)wi + [(to - l)zz - (2zzz - l)]wi 

and for z G [to + 1, zz] and I G [to] 



f Y.k<m,k^i Wk> {m- l)w/ - (to - 1)[(to - l)zz - (2 to - l)]zc* 

1 J2k<m,k^i Wk<{m- l)w/ + [(to - l)zz - {2m - l)]wi 

Now it is easy to see that to verify the first to(to — 1) inequalities is equivalent 
to verify just the following inequality 

T[m]l > (to - 2)zc|;([]„ - (to - 2) [(to - l)zz - (2 to - l)]zc|™'„ 
since [(to — l)zz — {2m — 1)] > 1. 

Analogously, the others inequalities can be verified by testing the following 
three ones: 

T[m]° < (to - 2)zc|™'„ + [(to - l)zz - (2zzz - l)]u}^}f 
T[m]l > (to - l)zc|rL - (to - 1)[(to - l)zz - (2zzz - 
T[m]° < (to - + [(to - l)zz - (2zzz - l)]w|™n^’"’ 
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This completes the proof of the necessary condition. 

The sufficient condition is trivial. In fact, it is easy to see that if relations (7) 
are verified, then 0 < P/ < 1 for all i G [n] and I G [m]/{i} and, therefore, the 
P/ are a set of strategies. From this, as the P/ are solutions of the system (4), 
they are a Nash equilibrium. □ 



Removing some constraints we can get the following theorem. 



Theorem 4. Consider a Route(N, P, S) problem such that S is an (/?, P)-set of 
supports with n>m = f3=a>2 and F = S, and a Route(N, H, S) such that 
F ^ H . If the solution to the Route(N, P, S) exists and 

T[m]o < {{m - l)n - (9) 

then also the solution to the Route(N, H, S) exists. The Nash probabilities for 
the Route(N, P, S') are those given in (8). 



Proof. If a solution to the Route(N, F,S) exists, then the relation (7) hold and, 
obviously, P = {P/} is a set of strategies. From this, it remains only to prove 
that this set of strategies gives a Nash equilibrium. Since for all the agents i G [n] 
such that P* = 0 or f G [/3 + 1, n], P/ > 0 for all j G [m] such that P* = 1 let 
us concentrate on the agents i such that F- = 0 and P* = 1. For these agents it 
results: 



c 



I 

i 




k>m 



l)n 



= — h W. 

m[{m — l)n — (2m — 1)] 




[(m 

k<m,k^i 




k>m 



l)n — 



l)n - (2m - l)]wfc + (m - 2)wj - ^.s 

(m — l)[(m — l)n — (2m — 1)] 

(2m - l)]wfc + (m - l)wi - Y.s<m,s^i 
m[(m — l)n — (2m — 1)] 



^ ^ [{m 

k<m,k^i,l 



l)n 



(2m - l)]wfc + (m - 2)wi - Y.s<m,s^k,i 
(jn — l)[(m — l)n — (2m — 1)] 



, \(m—l)n—m]wi—y , 

from which c( — d = , 



(m-i)[(ra-i)n-(2ra-t)] < « ^s the (9) holds, therefore 

the set of strategies P give a Nash equilibrium for the Route(N, H, S) , which 
completes the proof of theorem. □ 



3.3 (/3,/3)-Sets of Supports, /3 < m 

Each agent i is forbidden to route the traffic on link i, i G [a] and a = j3 < m. 

Theorem 5. Consider a Route(fV, P, S') problem such that S is a {(3, (3) -set of 
supports with m > (3 and n > m > 2 and F = S. A solution to Route(fV, P, S) 
exists if and only if, said 

A = n — 2 B = {m— l)n — (2m — P) C = {m — 2)n — (2m — (/3 + 2)) 

D = n— 1 E = \m — l)n — (2m — 1) F = \m — l)n — (2m — (/? + 1)) 
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the following inequalities hold V fc = 1, . . . , /3 

vzG[/?]/w 

0 < [f3+l,n] and I € [4 ^ 

and the Nash probabilities are: 



P = < 



0 

[AB — l)wi+Cwi — D*T^p]mi^ 

EFwi 

ABwi — jj, 

EFwi 

EFwi-\-[{m—l)B-\-{j3—l)]wi — {m — l)D:i‘T[i3]/^ij 
mEFwi 

Fwj-Tip] 

mFwi 



V z e [/?] , l = i 

V z € [/?], I G [f3]/ {i} 

VzG[/3], ^g[/3+1, m] (11) 

V i G [/3+l,zz],?S [/3] 

V z e [/3 + 1, n], ^ e [/? + 1 , to] 



Proof. As in the previous theorems, the system (4) has the values of (11) as 
solutions. Now, for the necessary condition 



(0 < Pi <iy iG [/?], I G [!3]/{i} 
[0 < Pi < ly i g [i 3+ l,n], I g[P] 



must hold, that is 



( 



0 < 
0 < 



{AB — l)wi-\-Cwi — D^T^0^/^ij'^ ^ . 

EFwi ^ ^ 

EFwi-\-[{m-l)B-\-{i3-l)]wi-{m-l)D^T[/3]/fi^ 



mEFwi 



y i G [/3], I G [/?]/{*} 

< 1 V z G [/3+ l,zz], I G[P] 



The sufficient condition is trivial. In fact, it is easy to see that if relations (10) 
are verified, then 0 < P/ < 1 for all z G [n] and I G [to]/{z} and, therefore, the 
P/ are a set of strategies. From this, as the Pi are solutions of the system (4), 
they are a Nash equilibrium. □ 



Removing some constraints we can get the following theorem. 



Theorem 6. Consider a Route(N, P, S) problem such that S is an (/?, P)-set of 
supports with m > (3 and n > m > 2 and F = S, and a Route(A^, H, S) such 
that F y. H . If the solution to the Route(fV, P, S') exists and 



^[/3] ^ m/3-6Tnn-3+4Tn-mn/3-2n^ + 5n-/3 + 2mn^-n/3 j. 

mnl3-4m‘^ + 2m.-3n + 3l3-l3‘^ + mn'^ -3mn + 2ml3 + nl3-m‘^n‘^+4m‘^n ^ [/ 3 ]q 



(12) 



then also the solution to the Route(N, H, S) exists. The Nash probabilities for 
the Route(N, P, S) are those given in (11). 



Proof. If a solution to the Route(N, F,S) exists, then the relation (10) hold and, 
obviously, P = {P/} is a set of strategies. From this, it remains only to prove 
that this set of strategies gives a Nash equilibrium. Since for all the agents z G [n] 
such that P* = 0 or z G [/3 + 1, zz], P/ > 0 for all j G [to] such that P* = 1 let 
us concentrate on the agents z such that F) = 0 and P* = 1. For these agents it 
results c\ — c\ = (mnj3 — Amf + 2m — 3zz + 3/3 — /3^ + mn^ — 3mn + 2mj3 + n(3 — 
TO^n^4TO^n)zCi + (— TO/3+6TOn+3— 4TO+TOn/3+2n^— 5n+/3— 2z7zn^— n/3)T[^]/{i} < 
0 as the (12) holds, therefore the set of strategies P gives a Nash equilibrium for 
the Route(N, H, S) , which completes the proof of theorem. □ 
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4 Conclusions 

The framework studied in this paper can also be used to handle centralized 
networks in which a coordinator wishes to force the system to a particular equi- 
librium point. For this reason, he fix a particular set of supports (compatible with 
a set of constraints) to decide which links can be used by each agent. Once the 
system has reached the equilibrium, he could then also remove the constraints, 
thus freeing the links and leaving the system still in equilibrium. 
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Abstract. The mobile agent rendezvous problem consists of fc > 2 mo- 
bile agents trying to rendezvous or meet in a minimum amount of time 
on an n node ring network. Tokens and markers have been used suc- 
cessfully to achieve rendezvous when the problem is symmetric, e.g., the 
network is an anonymous ring and the mobile agents are identical and 
run the same deterministic algorithm. In this paper, we explore how 
token failure affects the time required for mobile agent rendezvous un- 
der symmetric conditions with different types of knowledge. Our results 
suggest that knowledge of n is better than knowledge of k in terms of 
achieving rendezvous as quickly as possible in the faulty token setting. 



1 Introduction 

1.1 The Problem 

The general rendezvous search problem includes many situations that arise in 
the real world, e.g., searching for or regrouping animals, people, equipment, 
and vehicles. It has been extensively studied (see [1] for a detailed review), with 
operations research specialists producing the bulk of this research. In these inves- 
tigations, the search domain is usually assumed to be continuous, and symmetry 
occurs largely because locations in the search domain are indistinguishable. The 
searchers break the symmetry by running the same randomized algorithm or 
by running different deterministic algorithms. In the latter case, for example, 
one searcher remains stationary while another searches the domain. Even when 
the searchers in a general rendezvous problem marked their starting points, as 
prescribed by Baston and Gal [3], the case where searchers ran the same deter- 
ministic algorithm was largely unexplored. 

Computer scientists have been interested in the rendezvous search problem 
in the context of mobile agents and networks. After being dispersed in a network, 
mobile agents may need to gather or rendezvous in order to share information, 
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receive new instructions, or download new programs that extend the mobile 
agents’ capabilities. The mobile agent rendezvous problem differs from the gen- 
eral rendezvous problem in significant ways. Network topologies are discrete and 
include highly symmetric structures like rings, tori, and hypercubes. In addition, 
memory and time requirements are important in computer science problems but 
in the general rendezvous problem research the memory requirements are usually 
ignored. 

In the mobile agent rendezvous problem, symmetry can occur at several lev- 
els (topological structure, nodes, agents, algorithm), each playing a key role in 
difficulty of the problem and of its resolution. The standard symmetric situation 
is indeed a ring network of n identical nodes in which k identical agents running 
the same deterministic protocol must rendezvous. 

If no other computational power is given to the agents, rendezvous becomes 
impossible to achieve by deterministic means (eg. see [9]), and thus the problem 
is in general deterministically unsolvable. 

The research has thus focused on what additional powers the agents need to 
perform the task. The aim is to identify the “weakest” possible condition. Some 
investigators have considered empowering the agents with unique identifiers; 
this is for example the case of of Yu and Yung [9], and of Dessmark, Fraigniaud, 
and Pelc [4] who also assume unbounded memory; note that having different 
identities allows each agent to execute different algorithms. Other researchers 
have empowered the agents with the ability to leave notes in each node they 
travel, i.e., read/write capacity; this is the model of Barriere et al [2], and of 
Dobrev et al [5]. 

In the third approach, originally developed by Baston and Gal [3] for the 
general rendezvous problem, the agents are given something much less powerful 
than distinct identities or than the ability to write in every node. Specifically, 
each agent has available a marker, called token, placed in its home base, visible 
to any agent passing by that node; the tokens are however all identical. Since the 
tokens are stationary, the original intertoken distances are preserved while the 
agents move and, with enough memory, could be used to break the symmetry 
of the setting. This is the approach taken by Kranakis et al [6] and Flocchini et 
al [7], and is the one we follow in this paper. 

In the case of fc = 2 mobile agents, Kranakis et al [6] proved that solving the 
rendezvous problem in an n node anonymous ring requires 0(loglogn) memory; 
several algorithms were presented to demonstrate that when more memory is 
available, rendezvous time can be reduced. 

Flocchini et al [7] studied the use of tokens to break symmetry when k > 2 
and proved that when the agents run the same deterministic algorithm, ren- 
dezvous is impossible if the sequence of the original intertoken distances is pe- 
riodic. They also presented several algorithms, with various memory and time 
requirements, that solve the mobile agent rendezvous problem for two or more 
mobile agents. Finally, they proved that a solution to the leader election in the 
k > 2 mobile agent case implies a solution to the mobile agent rendezvous prob- 
lem, but the reverse is not true. 
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The tokens used in the research of Kranakis et al [6] and Flocchini et al [7] 
were not subject to failure. Once placed on a node, a token was always visible 
to any mobile agent on the same node. 

In this paper, we are concerned with fault tolerance; we investigate how the 
mobile agent rendezvous problem changes when tokens can fail. Such failures 
may be the result of hostile nodes or malicious mobile agents and may occur 
either as soon as the token is placed by the mobile agent or at any time later. 

1.2 Our Contribution 

In this paper, we study the rendezvous of identical agents in an anonymous ring 
when tokens may fail. We are concerned with the failure of a token which will 
make it no longer visible by the agents, and investigate how time required to 
rendezvous changes under symmetric conditions with different knowledge avail- 
able to the agents, in particular, knowledge of the number of agents and the size 
of the ring. 

We consider two types of failures. We consider first the case when tokens can 
fail and thus disappear upon their release. We then consider the more complex 
situations arising when the tokens can fail anytime. 

Some facts can be determined quite easily. If both the number k of agents 
and the size n of the ring are unknown, the problem is unsolvable [7] . If all tokens 
fail, clearly no rendezvous is possible. In this paper we assume that either n or k 
is known to all agents, that the number / of failed token is f < k. Furthermore, 
we assume that gcd{k',n) = 1 for all k' < k. Under these conditions we show 
that, rendezvous is possible regardless of the number of failures and of when they 
occur. It follows from [7] that if this condition does not hold, there will always 
be initial conditions under which rendezvous is not possible. 

The solutions and their time complexity are clearly different depending on 
the types of faults, as well as on whether the agents know the size n of the ring 
or their number k. The time required by our solution protocols is summarized 
in Table 1. When tokens are reliable and never fail, the existing algorithm [7] 
ensures rendezvous in 0{n) time when either A: or n is known. We note that all 
of these algorithms require O(fclogn) memory per mobile agent. 



Table 1. The Cost of Token Failure 



Tokens Fail 


Knowledge 


Time 


Algorithm 


never 


n or k 


0(n) 


[7] 


upon release 


n 


0(n) 


Meetl 




k 


0{k n) 


Meet2 


anytime 


n 


0{k n) 


Meets 




k 


0(k^ n) 


Meets 



If the tokens fail upon release, knowledge of n allows the agents to over- 
come all the failures; in fact, rendezvous takes place within 0{n) time units. 
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If the agents know k, we propose an algorithm that uses 0(kn) time units to 
rendezvous. Interestingly, this bound holds also if the ring is asynchronous. 

If tokens can fail anytime, but the mobile agents know n, we show that the 
time required for rendezvous only increases by a factor of k over that required 
when no tokens fail. On the other hand, if the agents know only k, we describe an 
algorithm that uses 0{k^n) time units to rendezvous. Both of these algorithms 
work in the synchronous setting. 

For both types of failures, our results suggest that knowledge of n yields a 
time saving over knowledge of just k. We note that in the fault-free environment 
this does not hold, in that it is always possible to compute n from k, or vice 
versa, in 0(n) time. 

2 Definitions and Basic Properties 

We consider a collection of fc > 2 identical mobile agents scattered on a syn- 
chronous ring of n identical nodes. The ring is unoriented, i.e., there is no glob- 
ally consistent indication of “left”, and the agents have no globally consistent 
notion of “clockwise” . Each mobile agent owns a single, identical, stationary to- 
ken that is comprised of one bit. A given node requires only enough memory to 
host a token and, at most, k mobile agents. 

Each agent is initially in a different node (its home base) whose location is 
unknown to the others. The agents follow the same deterministic algorithm and 
begin execution at the same time. The task for the agents is to rendezvous, i.e., 
to meet in the same node in finite time; the goal is to do so in as little time as 
possible. The problem is to design an algorithm, the same for all agents, that 
will allow them to perform the task. An agent releases its token in the first step 
of any rendezvous algorithm; when an agent transits on a node with a token, it 
can see it. Moreover, agents can see each other when they are in the same node. 

We call intertoken distances the sequence of distances between the fault-free 
tokens, once they have been released. Since the tokens are stationary, the original 
intertoken distances are maintained unless a token fails. 

When a token fails it is no longer visible to any agent passing by the node. 
We will consider two types of failures: when tokens can fail and thus disappear 
upon their release, and when the tokens can fail anytime. Let / be the number 
of tokens that fail. 

Some limiting facts can be determined quite easily. 

Fact 1 The problem is unsolvable: 

1. If both the number k of agents and the size n of the ring are unknown. 

2. If all tokens fail; i.e., f = k. 

Proof. Part (1) is due to [7]. Part (2) is well-known. □ 

We assume that the agents have knowledge of either k or n and that the 
number / of failed tokens is less than k. We also assume that that gcd(fc',n) = 
1 for all k' < k. It follows from [7] that if this condition does not hold, there will 
exist initial configurations under which rendezvous will not be possible. 
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3 Rendezvous When Tokens Fail Upon Release 

First, we assume a token can fail only upon release, i.e., in the first step of a 
given algorithm. The agent that released the token is unaware that it failed. If 
neither n, the size of the ring, nor k, the number of agents, are known to the 
agent the problem is unsolvable [7]; thus, in the following we will consider two 
cases: when only n is known and when only k is known. 

3.1 n Known, k Unknown 

The algorithm is rather simple. Each agent releases its token, and moves along 
the ring computing the distances between the successive tokens it encounters, 
and returns to its homebase. Let s\,. . . ,su be the sequence it computed, where 
Si and Sh are the distances between the homebase and the first encountered 
token, and between the last token and the homebase, respectively. If its own 
token did not fail, h = k — f and this sequence is exactly the (circular) sequence 
S = di, . . . ,dh of the intertoken distances of the tokens that did not fail. If 
however the agent finds that its token has failed, then h = k — f + 1 and the 
agent must adjust the sequence to determine the actual (circular) sequence of 
intertoken distances: S = d \, . . . , d^-i = Si + Sh, S 2 , . . . , Sh-i- 

Since the failures happen before the agents start to move around, the circular 
sequence computed in this way is the same for all agents within a rotation (due 
to the fact that the ring is not oriented). 

The agents will then agree on a meeting node by exploiting the asymmetry 
of the sequence, which cannot be periodic since gcd{k',n) yf 1 for all k' < k. 

Consider two sequences of intertoken distances, A = and B = 

{ji, . . . , jVi}, in a given ring with n nodes. The two sequences are lexicographi- 
cally equal if im = jm for to, where 1 < to < n. Sequence A is lexicographically 
greater than B \i im > jm for some to where 1 < to < n and for all q < m, 
iq = jq. Let Ra denote the rotations of the sequence A. The lexicographically 
maximum rotation of A is the sequence La in Ra such that no sequence in Ra 
is lexicographically larger than LA- 

Each mobile agent calculates a sequence of intertoken distances S. Let 
denote the reverse of S. Each mobile agent will compute the lexicographically 
maximum sequence in both S and S^. If both strings start at the same node, 
that will become the meeting point. Otherwise, the meeting point will be the 
midpoint in the odd path between the starting nodes of the two strings. 

The routine to identify the meeting point, which will be used in all our 
algorithms, is described in Table 2, where I exi{some Sequence) denotes the lex- 
icographically maximum rotation of someSequence. The overall algorithm is 
described in Table 3. The proof of the following theorem is straightforward. 

Theorem 1. When the agents have 0{k\ogn) memory, n is known, gcd{k' ,n) 
= 1 holds for all k' < k, f < {k — 1) tokens fail, and tokens can fail only upon 
release, then the mobile agent rendezvous problem can be solved in less than 2n 
time units. 
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Table 2. Calculation of the meeting point 



Meetingpoint (string) 

1. Compute forward = lexi(string) and reverse = lexi(string^) . 

2. Let X and y denote the nodes at the beginning of forward and reverse respectively. 

3. If a; = y, then x is the rendezvous point. 

4. If a: 7 ^ y, then the rendezvous point is the midpoint of the odd path 
between x and y (such a path must exist because n is odd). 



Table 3. Failure at the beginning, n known 



Let m = k — f he the number of remaining tokens. 

Algorithm MEETl 

1. Release the token at the starting node. 

2. Choose a direction and walk once around the ring. 

3. Compute the sequence of intertoken distances si, . . . ,Sh ■ 

If a token is present at this node: S = di, , dm = si , . . . , sji . 

If no token is present at this node: S — di, ..., dm = si + Sh, S 2 , ... , Sh-i . 

4. Move to Meetingpoint{S). 



Notice that, for this algorithm to work, the system does not need to be 
synchronous; in fact, each agent could independently make its calculations and 
move to the rendezvous point even if the ring is asynchronous. 

3.2 k Known, n Unknown 

More interesting is the case when n is unknown. As in the previous case, the 
agents will walk around the ring long enough to be able to calculate a sequence 
of intertoken distances; this sequence must be such that it covers the entire ring 
and contains an asymmetry that can be exploited by the agents to agree on a 
meeting point. 

Unlike the previous case, however, since n is not known, the circular sequences 
computed by the agents using solely knowledge of k, are not necessarily identical. 
In fact, each agent, to construct its sequence, will fully traverse the ring the same 
(unknown) number of times but, depending on the starting point and on the 
number of failures, it will also traverse a portion of the ring which is in general 
different for different agents. 

Although the sequences are not the same, we will show that the agents can 
still identify an unique point where to meet by calculating 3fc intertoken dis- 
tances. The Algorithm Meet2 is described in Table 4. 

Let denote the reverse of S. Since the tokens fail only upon release, 
can be partitioned as follows: 



S'^ = Q‘^ + d^,.--,di 



( 1 ) 
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where Q'^ is the concatenation of q copies of a unique aperiodic subsequence 
Q, + is the concatenation operator, and dj,. . . ,di is a subsequence such that 
7 < |Q|. Upon identifying the subsequence Q, the agents can identify a unique 
node upon which to rendezvous. 

Table 4. Failure at the beginning, k known 



Algorithm MEET2 

1. Release the token at the starting node. 

2. Choose a direction and start walking. 

3. Compute the sequence of 3fc intertoken distances i.e., S = di,d 2 , ■ ■ ■ , d^k- 

4. Let be the reverse of S. 

5. Find the shortest aperiodic subsequence Q that starts with the hrst element of 
and is repeated such that — Q'^ + dy, . . . ,di where 7 < \Q\. 

6. Move to M eetingpoint{Q) 



Theorem 2. When the agents have 0{klogn) memory, k is known, gcd{k' ,n) 
= 1 holds for all k' < k, f < {k — 1) tokens fail, and tokens can fail only upon 
release, then the mobile agent rendezvous problem can be solved in 0{kn) time. 

Proof. Let m = k— f be the number of tokens that do not fail. Let A = 6\, . . . ,5m 
be the sequence of the m intertoken distances that exist after the / tokens have 
failed; clearly ~ be the sequence of 3fc intertoken distances 

calculated by a given agent a in step 3 of Algorithm MEET2. Let S^{a) be the 
reverse of S{a) and be the reverse of A. For all agents, the 3k intertoken 
distances are of the form 

5«(a) = {A^r + d^,...,di = {6m,.--,SiY + d-„...,di. (2) 

where {A^)p is the concatenation of p copies of the aperiodic subsequence A^, + 
is the concatenation operator, and dj, ... ,d\ is a subsequence such that 7 < m. 
Thus, there exists at least one aperiodic subsequence, namely A^, that satisfies 
equation 1. Note that A^ is aperiodic as it has k or fewer intertoken distances, 
and by assumption gcd(fc',n) = 1 for all k' <k. 

If A^ is the shortest subsequence that satisfies equation 1, the agents discover 

J^R 

in step 5 of Algorithm MEET2. Otherwise, the agents discover a shorter 
aperiodic subsequence, Q, that satisfies equation 1. 

The subsequence discovered in step 5 of Algorithm MEET2 is unique. If the 
shortest subsequence has z elements, these elements are the first z elements of 
S^. Any other subsequence of the same length that satisfies equation 1 is also 
comprised of the first z elements of and thus the subsequence discovered in 
step 5 is unique. This implies that all the agents identify the same rendezvous 
node in the remaining steps of Algorithm MEET2 and rendezvous occurs. 

Calculating S, the sequence of 3k intertoken distances requires 0{k\ogn) 
memory and requires 0{kn) time. Identifying the appropriate subsequence in 
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step 5, determining the rendezvous node, and walking to the rendezvous node 
takes 0{kn) time, so the overall time requirement is 0{kn). This completes the 
proof of Theorem 2. □ 

When the tokens only fail upon release. Algorithm MEET2 also solves the 
mobile agent rendezvous problem when the ring is asynchronous. Once an agent 
has calculated S^, it can identify the smallest aperiodic sequence that satisfies 
equation 1. Since this sequence is unique, the agent can then identify the unique 
rendezvous node, walk there, and wait until all k agent arrive. The algorithm 
does not depend on timing but rather on the agents’ ability to count and then 
identify the rendezvous node. 



4 Rendezvous When Tokens Can Fail at Any Time 

We now consider the situation when token failures can occur at any time. In this 
case, for the algorithms to work the system must be synchronous. 

The idea is still to make use of the intertoken distances to agree on a meeting 
point. In this case, however, the problem is made much more complicated by 
the fact that these distances vary unpredictably during the algorithm. To cope 
with that, our algorithm works in rounds, in each round, the agents compute 
some intertoken distances and try to meet in a node. In a round, however, the 
rendezvous may fail because the agents might have computed different intertoken 
distances. In such a case, only groups of agents (the ones that have computed 
the same intertoken distances) might meet (in a “false” rendezvous point). 

To make the algorithms work, the rounds must be synchronized so that the 
agents start them within some bounded time interval (if not simultaneously) . In 
other words, an agent arriving at its meeting point will have to know how long 
to wait before declaring that point a false rendezvous point and correctly start 
the next round, or before realizing that such a point is a true meeting point. 



4.1 n Known k Unknown 

First of all notice that since k is unknown, a group of agents finding themselves 
on the same node cannot determine whether rendezvous has been accomplished 
simply by counting how many they are. A different strategy will have to be used. 

A round is composed of three distinct steps. In stepl an agent travels around 
the ring to compute the intertoken distances; it then identifies the lexicographical 
maximum string. In step2 the agent moves to the computed meeting point (let 
us assume that it takes t time units to go there), and waits n — t units to 
synchronize with the others for the next step. Notice that, in general, the agent 
cannot understand at this point if rendezvous is accomplished just by counting 
the other agents on the meeting node. As we will prove, if this is the third time 
that the same string has been calculated, the agent can be sure that this is a 
true meeting point. In this case, in fact, the agent knows that everybody else 
has seen this string at least in the previous and in the current rounds and is now 
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in its own meeting point. StepS is used for notifying all agents that indeed the 
meeting point is the correct one. The agents who know, go around the ring. The 
agents who do not know, wait for n units: if nothing happens, they go to the 
next round; if a notifying agent arrives, they terminate the algorithm. 

The Algorithm is described in Table 5. 



Table 5. Failure at any time, n known 



Algorithm MEETS 

1. Release token. 

2. Set S'! = 52 = Ss = 0 

3. Set r = 0, choose a direction and begin walking. 

4. Travel for n time units computing the intertoken distances S = (si, , Sh). 

If a token is present at the last node: S = di, ... , dm = si, . . . , Sh . 

If no token is present at the last node: S — di, . . . , dm = si + Sh, . . . , Sh-i . 

5. Set t = 0. 

6. Walk to Meetingpoint{S) and increment t for each node traveled. 

7. Wait n — t clock ticks. 

8. Set Si = S 2 -, S 2 = 5s; S 3 = S. 

9. If Si = S 2 = S 3 

10. become(notifying), go around the ring, and then terminate 

11. Else wait n units 

12. If, while waiting, a notifying agent arrived, terminate. 

/* Rendezvous has occurred. */ 

13. Else, after waiting, set r — r + 1 and repeat from step 3. 



Lemma 1. When an agent sees the same string for the third time, all the other 
agents have seen it at least twice (in the previous and in the current round). 

Theorem 3. When the agents have 0{k\ogn) memory, know n, gcd{k',n) = 1 
holds for all k' <k, f < {k — 1) tokens fail, and tokens can fail upon release or 
later, then the mobile agent rendezvous problem can be solved in 0{kn) time. 

Proof. By lemma 1 we know that when an agent sees the same string for the 
third time the meeting point was the true rendezvous point. All the agents met at 
that point and after n time units everybody will terminate. In each round, every 
agent has spent n time units to compute the intertoken distances, n additional 
time units to move to the meeting point and wait for the restart, n time units for 
the termination check. Thus, all agents are perfectly synchronized and start each 
round simultaneously. After, at most k + 2 rounds rendezvous is accomplished 
and the total time is then 0{kn). □ 



4.2 k Known n Unknown 

Lack of knowledge of n makes the situation more complicated; the main problem 
is to achieve synchronization. 
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In the following algorithm, if more than one but fewer than k agents meet on 
a given node, they merge and act as one agent for the remainder of the algorithm. 
Merged agents will be perceived by other agents with their multiplicity; thus, if 
an agent meets a group of merged agents, it will “see” how many they are. In 
this algorithm, before starting the next round, every group that has met in a 
false meeting point, merge and behave like one agent. 

Since the agents, not knowing n, cannot travel around the ring, they will 
travel trying to guess the ring size at each round. Initially they compute k 
intertoken distances and they guess n to be the sum of those distances (they use 
this guessed value for synchronization purposes); at each subsequent round they 
compute one less intertoken distance and they change their guess of n. 

The algorithm is designed in such a way that 1) it is guaranteed that agents 
are always in at most one round apart, 2) after at most k — 1 rounds rendezvous 
is accomplished. The Algorithm is described in Table 6. 



Table 6. Failure at any time, k known 



Algorithm MEET4 

1. Release token. 

2. Set r = 0, where r denotes a round of the algorithm. 

3. Choose a direction and begin walking. 

4. Upon meeting another agent, merge with it. 

5. Calculate the first k — r intertoken distances, i.e., S = (di, . . . , dfc-r)- 

6. Estimate n as h = dj. 

7. If S is periodic, wait 2h steps, set r = r + 1, and repeat from step 3. 

8. Calculate Slmr, the lexicographically maximum rotation of S. 

9. Set t — 0. 

10. Walk to the node that starts Slmr and increment t for each node traveled. 

11. Wait 2h - t clock ticks. 

12. If there are k agents or their merged equivalent on the current node, stop. 
/* Rendezvous has occurred. */ 

13. Else if there are 1 < v < k agents on the current node, then merge. 

14. Set r = r + 1 and repeat from step 3. 



The following three lemmata are used in the proof of Theorem 4. Lemma 2 
proves that the agents are always less than a round apart and thus an agent 
need only wait ^ clock ticks for any agents that saw the same view. 

Lemma 2. Let a be the first agent to finish estimating n in round r, where 
Q < r < k—1, and let t denote the time at which agent a finishes the estimation. 
All other agents will either finish round r—1 or merge with a on or before time r. 

Lemma 3. Mobile agents that see the same sequence S, up to a rotation, of 
intertoken distances are in the same round. If S is aperiodic, then these agents 
will rendezvous at the end the given round. However, if S is periodic, then the 
agents cannot rendezvous during this round and must proceed to the next round. 
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Lemma 4. If at most f tokens have failed, then the agents will not execute 
round r = / + 1 in Algorithm MEET4- 

Theorem 4. When the agents have O(fclogn) memory, know k gcd{k',n) = 1 
holds for all k' < k, f < {k — 1) tokens fail, and tokens can fail upon release or 
later, then the mobile agent rendezvous problem can be solved in 0{k^n) time. 

Proof. Let / < A: — 1 be the number of tokens that actually fail. Lemma 4 implies 
that no agent will execute more than f+1 rounds of Algorithm MEET4. Suppose 
that rendezvous has not occurred by the end of round r = f. Let a* denote the 
first agent that begins round r = / + 1 and let tg denote the time that a* starts 
round r = / + 1. Since / tokens have failed, a*’s estimate for n in round r = 
/ + 1 will be correct, i.e., h = n. The remaining agents see the same aperiodic 
sequence, up to a rotation, of intertoken distances as a* and thus Lemma 3 
implies that rendezvous occurs at the end of round /. 

The number of tokens that fail, /, is at most fc — 1 so, as mentioned above, 
at most k rounds of Algorithm MEET4 are executed. A round takes at most 
kfn — 1) time, i.e., the product of the number of intertoken distances measured 
and the maximum intertoken distance possible. As a result, the time required is 
0{k^n). Because at most k intertoken distances are calculated and the maximum 
intertoken distance is n — 1, the memory required is O(fclogn). This completes 
the proof of Theorem 4. □ 

5 Conclusion 

The effect of token failure on the time required to rendezvous suggests that 
it would be interesting to explore other sources of failure in the mobile agent 
rendezvous problem. For example, what are the implications for rendezvous when 
mobile agents fail? Mobile agent failure could be partial, such as not merging 
when appropriate, or absolute, such as not operating at all. It would also be 
interesting to determine the impact of network problems, such as heavy traffic, 
on mobile agent rendezvous. 
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Abstract. We study here the gossiping problem (all-to-all communication) in 
known radio networks, i.e., when all nodes are aware of the network topology. 
We start our presentation with a deterministic algorithm for the gossiping problem 
that works in at most n units of time in any radio network of size n. This is an 
optimal algorithm in the sense that there exist radio network topologies, such 
as: a line, a star and a complete graph in which the radio gossiping cannot be 
completed in less then n units of time. Furthermore, we show that there isn’t 
any radio network topology in which the gossiping task can be solved in time 
< [log(n — 1)J +2. We show also that this lower bound can be matched from 
above for a fraction of all possible integer values of n; and for all other values of 
n we propose a solution admitting gossiping in time [log(n — 1)] + 2. Finally 
we study asymptotically optimal 0(D)-time gossiping (where D is a diameter 
of the network) in graphs with max-degree A = 0( ^ iog»„^ — )> integer 

constant i > 0 and D large enough. 



1 Introduction 

The two classical problems of information dissemination in computer networks are the 
broadcasting problem and the gossiping problem. In the broadcasting problem, we want 
to distribute a particular message from a distinguished source node to all other nodes 
in the network. In the gossiping problem, each node v in the network initially holds a 
message m„, and we wish to distribute all messages to all nodes in the network. In 
both problems, we very often like to minimise the time needed to complete the tasks. 

A radio network can be modelled as an undirected graph G = {V, E), where V 
represents the set of nodes of the network and a set E contains (unordered) pairs of 
vertices, s.t., a pair {v, w) G E, for some v,w G V iff nodes v and w can communicate 
directly. We say that all neighbours of a vertex v G V form the range of v. One of 
the radio network properties is that at any time step each processor is either in the 
transmitting mode or in the receiving mode. We also assume that all processors work 
synchronously, and if a processor u transmits a message m at time step t, the message 
reaches all nodes within its range at the same time step. However any of its neighbours 
V will receive message m successfully iff u is the only node that transmits to v (has v in 
its range) at time step t. Otherwise a collision occurs and the message m does not reach 
the node v. 

In this paper we focus on gossiping algorithms that use the entire information about 
the network topology. Such topology-wise communication algorithms are useful in 
radio networks that have relatively stable topology/infrastructure. And as long as no 
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changes occur in the network topology during the actual execution of the algorithm, the 
task of gossiping is completed successfully. Another interesting aspect of deterministic 
communication in known radio networks is its close relation to randomised communi- 
cation in unknown radio networks. 

1.1 Previous Work 

Most of the work devoted to radio networks is focused on the broadcasting problem. In 
the model with known radio network topology Gaber and Mansour [8] showed that the 
broadcasting task can be completed in time 0{D + log® n), where D is the diameter 
of the network. In other work of Diks et al. [7] we find efficient radio broadcasting al- 
gorithms for (various) particular types of network topologies. However in the general 
case, it is known that the computation of an optimal (radio) broadcast schedule for an 
arbitrary network is NP — hard, even if the underlying graph of connections is em- 
bedded in the plane [2, 12]. The gossiping problem was not studied in the context of 
known radio networks until very recent work of G^sieniec and Potapov [10]. In that 
paper one can find a study on the gossiping problem in known radio networks, where 
each node transmission is limited to unit messages. In this model they proposed sev- 
eral optimal and almost optimal 0(n) -time gossiping algorithms in various standard 
network topologies, including: lines, rings, stars and free trees. They also proved that 
there exists a radio network topology in which the gossiping (with unit messages) re- 
quires l7(nlogn) time. A similar work, in model with messages of logarithmic size, 
can be found in [1], where Bar- Yehuda et al. study randomised multiple point-to-point 
radio communication, and in [5], where Clementi et al. consider simultaneous execu- 
tion of multiple radio broadcast procedures. So far, the gossiping problem was mostly 
studied in the context of ad-hoc radio networks, where the topology of connections is 
unknown in advance. In this model, Chrobak et al. [4] proposed fully distributed deter- 
ministic algorithm that completes the gossiping task in time 0(v?l'^ log® n). For small 
values of diameter D, the gossiping time was later improved by Gqsieniec and Lingas 
[9] to 0(nZli/2 iog3n). Another interesting 0(n®/^)-time algorithm, a tuned version 
of the gossiping algorithm from [4] can be found in [14]. A study on deterministic 
gossiping in unknown radio networks with messages of limited size can be found in 
[3]. The gossiping problem in ad-hoc radio networks attracted also studies on efficient 
randomised algorithms. In [4], Chrobak et al. proposed a O(nlog^n) time gossiping 
procedure. This time was later reduced in [11] to 0{n log® n), and very recently in [6] 
to 0(nlog^ n). 

1.2 Our Results 

We start here with a proof that in any known radio network of size n, the gossiping task 
can be completed in at most n units of time. Please note here that we are interested in 
exact complexity (as oppose to the asymptotic complexity) since the design of an 0(n)- 
time gossiping procedure is rather trivial. Our new algorithm is optimal in the sense that 
there exist radio network topologies including: lines, stars, but also complete graphs, in 
which the gossiping task cannot be completed in time < n. Further, we show that there 
isn’t any radio network topology in which the gossiping task can be solved in time < 
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[log(n— 1)J +2. We also show that this bound can be matched from above for a fraction 
of all possible integer values of n; and for all other values of n we propose a solution 
admitting gossiping in time [log(n — 1)] + 2. The second part of the paper is devoted 
to efficient gossiping in arbitrary graphs (as oppose to the worst case) graphs. This 
work is done in relation to [8], where the authors proposed an asymptotically optimal 
0{D + log^ n)-time broadcasting procedure in known radio networks. For obvious 
reasons, the diameter is a natural lower bound for both the broadcasting and gossiping 
problems. In this paper we study a non-trivial class of graphs in which the gossiping can 
be done in time 0{D), i.e., in optimal asymptotic time. We first show that the gossiping 
can be performed in time {2D — 1)A + 1 in graphs with the maximal degree A. This 
result admits 0(Z7)-time gossiping in all graphs with D = 0(1). Later we show how to 
perform gossiping in time 0{D) in any graph with A = 0{ ^ ^ ), for any integer 

constant i > 0 and D large enough. 

2 The Gossiping Algorithm in Time < n 

Let the graph G = (V,E) he the underlying graph of connections in a given radio 
network, where n = \V\ stands for the size (number of nodes) of the network. We 
assume that initially each node v G V holds a unique message rriy. The gossiping task 
(all-to-all communication) is performed in discrete units of time, also called time steps. 
At any unit of time, a node can be in one of the two transmission modes: either in 
the receiving mode or in the transmitting mode. A gossiping algorithm is understood 
to be a well defined fransmission/reception procedure, for each node of the network. 
We assume here that during a single time step a transmitting node is allowed to send 
a combined message which includes all messages that it received so far. The running 
time of a gossiping algorithm is the smallest number of time steps in which the all-to-all 
communication is completed. 

We say that a radio network has a radius k if there exists at least one node in V 
which is at distance < k from all other nodes in the network, and there isn’t any node in 

V at distance < k from all other nodes in the network. We call this special node a central 
node and we denote it by c. Let be a gossip message originated in a node v G V. 
At any time step of the gossiping process My denotes the set of messages acquired by 

V until now. E.g., initially My = {rriy}. We also distinguish a set T of nodes that have 
not transmitted (at all) since the beginning of the gossiping process. Initially T =V.\n 
a radio network with a radius k and a central node c, we can partition the set of nodes 
T into disjoint subsets Nq,Ni, ..,Nk, such that the set Ni = {v : distc{v,c) = i}, 
for i = 0, 1 , 2 , ..., k, where distc{u, v) stands for a length of a shortest path between 
nodes u and v in G. In other words, Nq contains only the central node c, Ni contains all 
neighbours of the central node, etc. A reasoning presented here is based on the notion of 
a minimal covering set. The minimal (in terms of inclusion) covering set Gi is a subset 
of Ni, s.t., every node in is connected to some node in Gi, and a removal of any 
node from Ci destroy this property. 

Observation 1 Each node v G Gi is connected to some node u G Ai+i , such that u is 
not connected to any other node in Gi — {u}. 
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Fig. 1. Radio network with central node c and radius k 



2.1 The Outline of the Algorithm 

The general idea of the gossiping algorithm is as follows. Initially we point out that in 
any radio network with radius 1 and size n the gossiping task can be completed in time 
n. In radio networks with larger radii we show that there always exist four distinct nodes 
v,v' ,w and w' , s.t., v can transmit (its current content) to v' and w can transmit 
to w' in the same time step, and that the removal of both v and w does not disconnect 
the remaining part of the network. We call this approach a 2-vertex reduction. The 
gossiping algorithm uses the new approach for the purpose of moving all n messages to 
some connected subnetwork (of the original network) with radius 1 and size n — 2fc, in 
exactly k units of time. This is followed by the gossiping in the subnetwork in at most 
n — 2k units of time. Finally, we use the last k units of time to distribute the messages 
to all other nodes in the network by reversing the transmission direction in the initial 
k units of time. This means that the gossiping in any radio network of size n can be 
completed in time < n. 

2.2 Gossiping in Radio Networks with Radius 1 

We show here that in any radio network topology with radius 1 the gossiping algorithm 
can be completed in time n, where n is the size of the network. W.l.o.g., we assume that 
n > 2. According to the dehnition of a radio network with radius 1 , we know that there 
exists a central node c which is at distance < 1 from any other node in the network. In 
this case, we can transmit all messages rriv, s.t., v G V — {c} to the central node c, one 
by one, in time n — 1. When this stage is completed. Me = {niy : v G V}. And we 
need one more time step to disseminate the content of My to all other nodes in the radio 
network. 



2.3 Gossiping in Radio Networks with Higher Radii 

We show here an efficient reduction of a network with an arbitrary radius k to its con- 
nected subnetwork with radius 1. This reduction is based on the concept of 2-vertex 
reduction, introduced at the beginning of this section. 
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Reduction 2— >1. We prove here that any radio network with radius 2 can be efficiently 
reduced (by a sequence of 2- vertex reductions) to its subnetwork with radius 1 . Given a 
radio network with radius 2. Initially, we choose a central node c and we construct sets 
Nq, Ni, N2 and a covering set Ci. As long as |Ci | > 2, we can transmit successfully 
the contents of at least two nodes v,w £ N2 to some nodes v' ,w' £ Ci at each time 
step, thanks to the covering property of Ci . Please note that after each transmission 
nodes v, w are “removed” temporarily from the network and the content of the covering 
set Cl is recomputed. When eventually \C\\ < 2 two cases apply. When jCil = 0 the 
radius of the subnetwork becomes 1, and the reduction is completed. If jCil = 1, we 
call the node remaining in C\ an essential node ei. Let Yi be the set of nodes in A^i 
that are not neighbours of ei. If both |Li|, |A^ 2 | > 0, we can always chose v £Y\ and 
w £ N2, where u' = cand w' = ei. If eventually |A^ 2 | =0 also jCil = 0, the reduction 
is completed. Alternatively when |yi | = 0, the essential node ei becomes a new central 
node in a subnetwork with radius 1. 

Reduction k+1— >k. We prove here that any radio network with radius fc + 1 can be 
efficiently reduced (by a sequence of 2-vertex reductions) to its subnetwork with radius 
k, for any k > 2. 

In a radio network with radius A:-|- 1, we initially choose a central node c and we con- 
struct the layer sets Nq, Ni,N2, ■ ■ ■ , Nk, N^+i and the covering sets Ci, C2, C3, . . . , 
Ck-i,Ck- Let Ci be a set Ni — Ci, for alH = 1,2,3, ... ,k — l,k. Note, that as long as 
\Ck\ > 2, we can transmit successfully the contents of at least two nodes v,w £ N^+i 
to some nodes v' ,w' £ Ck during each time step, thanks to the covering property of 
Cfc. When eventually \Ck\ < 2 two cases apply. When \Ck\ = 0, the radius of the 
subnetwork becomes k, and the reduction is completed. If ICfej = 1, we call the node 
remaining in Ck an essential node Ck- By Yk we denote the set of nodes in Nk that are 
not neighbours of Cfc. If now |Ci| -f IC 2 I -f l^sl-l-, . . . , |C'fe_ 2 | -f \Ck-i\ + |Yfc| > 0 and 
|7Vfe_|_i I > 0 we match any transmission from a node v £ Nk+i to a node v' £ Ck with a 
transmission from a node w £ CiUC' 2 UC' 3 U, . . . , C'fe_ 2 UC'fe_iUyfc to its neighbour w' 
in the layer closer to the central node. This reduction process will terminate when even- 
tually either: the set Nk+i is empty, which means that the radius of the subnetwork has 
been reduced to k\ or at some point |Ci|-l-|G 2 | + |(53|+, • • ■ , |C'fe- 2 | + |C'fe-i| + |Lfc| = 0. 
But then we know that the essential node Ck is a neighbour of all nodes in the set 
Nk — {cfc}. In this case, the essential node Ck becomes a new central node since its 
distance from any other node in the network is < k. The following theorem follows: 

Theorem 1. The gossiping task can be solved in any radio network of size n in at most 
n units of time. 

3 Optimal Topology for Gossiping in Radio Networks 

In this section we present a radio network topology in which radio gossiping can be 
performed in time [log(n — 1)] + 2. We later present a simple argument that radio 
gossiping cannot be completed in time < [log(n — 1)J + 2. We conclude this section 
with presentation of more complex topology that allows to perform radio gossiping in 
time [log(n — 1)J + 2, for a fraction of all possible integer values of n. 
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3.1 Upper Bound 

A topology of “gossiping-friendly” radio network supports the following transmission 
strategy. Initially collect (gather) all gossip messages in one distinguished node c and 
then distribute the messages to all other nodes as quickly as its is possible (i.e., in a 
single time unit). In order to achieve this goal we select a node (center) c € V and 
we connect it with any other node in V. The remaining n — 1 vertices in V — {c} are 
organised in an optimal broadcasting tree (OBT) used in the matching model, see, e.g., 
[13], with a root r. 

We point out here, that an OBT of size m rooted in node r can also serve the pur- 
pose of gathering messages (in radio network model) in root r in the optimal time 
[log to] . This is done by reversing all transmissions in time. I.e., if during broadcasting 
(in matching model) node v transmits to node w in step i = 1, .., [log to ] , in gathering 
algorithm (in radio network) node w transmits all messages it collected so far in step 
[log to] — i + 1 . Using this observation we show that we can collect all messages from 
V — {c} in root r in time [log(n — 1)] — i + 1. These messages are later passed onto 
the central node c, who has all messages now. In the last step node c transmits a com- 
bined message (containing all messages) to all other nodes in one time step (since c is 
connected to every other node in the network). And the gossiping process is completed 
in time [log(n — 1)] + 2. 

Lemma 1. There exists a radio network topology in which the gossiping task can be 
completed in time [log(n — 1)] + 2, for any integer n. 

3.2 Lower Bound 

On the other hand, note that during each consecutive round knowledge (a number of 
possessed messages) in each node can at most double. This means that after step i 
knowledge of any node is limited to 2* original messages. Thus, after initial [log(n— 1)] 
steps of any gossiping algorithm in any radio network topology, non of the nodes is 
completely informed, since < n. Note also that during the last round of the 

gossiping process the only nodes that are permitted to transmit, are those who already 
possess all messages, since a transmitting node cannot receive messages at the same 
time. The following lemma follows. 

Lemma 2. The completion of the gossiping task in any radio network requires at least 
[log(n — 1)] -I- 2 steps. 

3.3 Tightening the Gap 

We have just presented both the upper and the lower bounds for the most suitable topol- 
ogy for radio gossiping. Please note that the upper bound and the lower bound meet 
each other when n = 2^ -f 1, for any integer k. For all other values of n the gap 
between the bounds is 1. This poses an interesting question, i.e., which of the two: 
[log(n — 1)] -f 2 or [log(n — 1)] -f 2 is the correct exact bound? In this section we 
show that the latter one is not. We propose more sophisticated radio network topology 
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in which, for large enough n, s.t., n < 2^ + — 0(2^), the gossiping can be done 

in time k + 2 = [log(n — 1)J + 2. 

Consider a network H which is composed of three components, see Figure 2: 

- a tree T\ with a root ri (and its two exact copies T2 and including adjacent 
edges), 

- a tree T 2 with a root h^, and 

- a group of special nodes: three roots ri,r 2 ,r 3 , three central nodes ci,C 2 ,C 3 , and 
three helpers hi,h2,h^- 




Fig. 2. The network topology for the optimal gossiping in time [log(n — 1)J +2 

The structure of trees T\ and T2 is very much based on the structure of the optimal 
broadcasting (in our case gathering) tree OBT(d) which is of size 2'^ and it contains 
d + 1 (time) levels. The nodes in Ti and T2 that transmit in time step 1 during the 
gathering process are understood to be at layer 1, those that transmit in time step 2 at 
layer 2, etc, see Figure 2. Thus, layers in the trees Ti and T2 are enumerated in reverse 
order, comparing to the numbering of layers in standard OBTs. The tree Ti is obtained 
from OBT(/c) via deletion of three sets of nodes Di,D2,Ds (to be defined later) and all 
edges connected to them. The tree T2 is obtained from OBT(fc — 3) via deletion of two 
sets of nodes D^, D5 and all edges connected to these nodes. The content of each Di, 
for i = 1, .., 5 is defined as follows: 

- Z7i is a set of leaves in Ti that are children of the nodes at layers (| + 1), .., fc — 1, 

- 172 is a set of nodes, from layers 1,2,3 and 4 in Ti that belong to subtrees rooted 
in children and grandchildren of the root ri , 

- D3 is a set of nodes of a subtree of T\ rooted in node di , where di is a child of the 
root ri at layer fc/2, 

- contains 3 nodes in T2'. two children x\ and X2 of the root at layer 1 and 2 
respectively, and the child of X2 at layer 1, and 

- Us is a set of nodes in T2 that form a subtree rooted in node d2 , where c ?2 is a child 
of the root at layer k/2. 
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Another important component of the network is a set of special nodes. This set 
includes, the root ri of T\ and its entirely equivalent copies r2 and rs, which are con- 
nected to the same nodes (as r\ is) in the tree Ti. On the top of this, the roots ri, V2 and 
T3 are mutually connected. The roots will be used to sent messages collected from the 
tree T\ to the central nodes Ci, and C3 in one time step (i.e., step k + 1). 

The set of special nodes includes also three central nodes ci, C2 and C3. The central 
nodes share direct connections to all other nodes in the network H. In fact, the direct 
connection from the central nodes form a partition of other nodes. It means that after 
we gather all messages in each of the nodes ci, C2 and C3, we are able to distribute the 
messages to all other nodes (i.e., complete the gossiping process) in a single time step. 
We show later how to inform all central nodes in at most k+l rounds. In particular, the 
center ci is connected to all nodes in Ti at layers 2, to the root ri, and the helper 
hi - The center C2 is connected to all nodes in T\ at layer 1 and layers (|- + 1 ), .., (/c — 1 ), 
the root r2, and the helper /12. The center C3 is connected to all nodes in T2 (including 
the helper h^), the child of the root in T\ at layer k, and the root r^. 

The last group of special nodes contains three helpers h\, h2 and /13, where /13 is 
the root of T2. They are mutually connected and their purpose is to exchange original 
messages from the central nodes and to acquire messages gathered in the tree T2. 

Once the construction of the network H is completed we show that gossiping in H 
can be performed in A: + 2 = \log{n — 1 )J + 2 rounds. The expression a ^ b,c, d, ... 
is used to denote that a node a sends its all current knowledge to nodes b,c,d, ... and 
kki||W2 means that transmissions Wi and W2 are performed simultaneously. 

Steps 

( 1 ) 

( 2 ) 

( 3 ) 

( 4 ) 

(|) 

(| + 1 ) 

(k-2) 

(k-1) 

(k) 

(k+l) 

(k+2) 

During the gossiping process (in the network H) we first collect all messages in 
central nodes ci, C2 and C3 in time k+l = [log(n — 1 )J + 1 . The main idea behind the 



Transmissions 

C2 ^ ^2 1 1 C3 ^ /13 1 1 (all nodes at layer 1 in Ti and T2 transmit) 

ri T2,r3 II ^2 ^ hs, hi || 

(all nodes at layer 2 in Ti transmit) 1 1 (all nodes at layer 1 in T2 transmit) 
T2 ^ +1 , +3 II (all nodes at layer 3 in Ti transmit) 1 1 
(all nodes at layer 2 in T2 transmit) 

T3 ^ ri , T2 1 1 (all nodes at layer 4 in Ti transmit) 1 1 
(all nodes at layer 3 in T2 transmit) 

Cl — > A-i 1 1 (all nodes at layer | in Ti transmit) 1 1 
(all nodes at layer | — 1 in T2 transmit) 
hi ^ h2, hs \ \ (all nodes at layer | + 1 in Ti transmit) 1 1 
(all nodes at layer | in T2 transmit) 

(all nodes at layer A: — 2 in Ti transmit) 1 1 
(all nodes at layer A: — 3 in T2 transmit) 

^3 ^ C3 1 1 A-3 ^ /i2 1 1 A13 — *■ ^1 1 1 (all nodes at layer A; — 1 in Ti transmit) 

^1 ^ Cl 1 1 A-2 ^ C2 1 1 (a node at layer k in Ti transmits to ri , T2 and rs) 

ri ^ Cl 1 1 T2 ^ C2 1 1 +3 ^ C3 

ci,C2,C3 transmit to all their neighbours. 
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removal of sets D\ through is to avoid collisions when the special nodes act, i.e., 
when they transmit and listen. The loss of nodes caused by removal of the sets Di,D 2 
and Z ?3 from Ti is compensated by the nodes available in the tree T 2 . In fact, the size 
of H formed of trees T\ and T 2 and a few more special nodes is 2^ + 2^“^ — 0(2^/^). 
This is due to the fact that the cardinality of each Di, for z = 1, 5 is 0(2^/^). The 
following lemma holds: 

Lemma 3. There exists a radio network topology in which the gossiping task can be 
completed in time [log(rz — 1)J + 2, for any integer n = 2^ + 2^“^ — 0(2^/^), and k 
large enough. 

In particular, we conclude that we know how to build the optimal (in terms of gos- 
siping) radio network topology for a fraction of all integer values of n. 

4 Gossiping in Time 0{D) 

We discuss here a class of graphs admitting radio gossiping in time 0{D). 

4.1 Gossiping in Time (2JD — 1).^ + 1 

The general idea of the algorithm is as follows. Initially, we chose a central node c 
and we partition all nodes into disjoint subsets, layers U, where 0 < i < D. This is 
followed by gathering stage when all (other n — 1) messages are moved to the central 
node c, layer by layer. Finally, a combined message (including all original messages) is 
distributed from c to all other nodes, also layer by layer. In what follows we show that 
all messages that reside at layer Ik can be moved to a neighbouring layer Ik-i (or Ik+i) 
in at most A units of time. 

Lemma 4. All messages available at layer Ik can be moved to a neighbouring layer 
Ik-i in at most A units of time, where \ < k < D. 

Proof. We use here notation introduced in Section 2. Let = Ik, and C^_i (subset of 
Ik-i) be the minimal covering set for Nj^. I.e., every node in is connected to some 
node in and removal of any node from C^_i destruct this property. Note, that 

every node v G C^_i is connected to some node u G N^, s.t., u is not connected to 
any other node in — {n}; otherwise we could remove v from C^_i. Thus, during 
a single time step, every node v G C^_i receives a message m„ transmitted from its 
unique node u G N^. Later, node u is removed from N^„ which means that a (virtual) 
degree of each node in C^_i is decreased by one. After removal of all us involved in the 
transmissions we end up with a new set N^, and its new covering set C C^_i- 

We repeat the whole process at most A times, since the degree of nodes in the covering 
set is decreased by 1 during each round of transmissions. 

This means that the gossiping task in any radio network with diameter D and max- 
imum degree A can be completed in time < {2D — 1)Z\ -I- 1, where DA comes from 
the gathering stage and 1 + {D — 1)Z\ from the broadcasting stage. 

Theorem 2. In any graph G, with a diameter D and a constant maximum degree, the 
gossiping task can be completed in time 0{D). 
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4.2 Gossiping in Graphs with Larger Max-Degree 

An algorithm presented in this section is based on a concept of efficient broadcasting 
0{D + log® n)-time procedure presented in [8]. We use here very similar partition of 
a (network) graph into clusters and super-levels. The main difference lies in greater 
complexity of the gossiping problem. 



Cluster Graph and Tree of Clusters. Assume we have a graph G = {V, E), where 
\V\ = n and a distinguished node s & V. Assume also that the diameter of G is at 
most D. A layer U in G is formed by nodes that are at (same) distance i from s, for 
i = 1 , .., D. All layers in G are grouped in x super-levels, s.t., the jth super-level is 
formed of layers I du~i) cj , for j = 1, .., x. Each super-level is covered by the 

X X 

set of clusters, s.t., (1) each cluster has diameter (2) the union of the clusters 

covers the super-level, and (3) the clusters graph can be coloured with 0(log n) colours, 
where the clusters graph is obtained by treating each cluster as a node, and introducing 
an edge between two nodes if in the original graph there is some edge that connects 
nodes from the corresponding clusters or if the clusters share a common node. Note, 
that the number of clusters does not exceed n; otherwise we would be able to remove 
at least one (redundant) of them. It also follows from construction presented in [8] that 
each cluster at super-level i has a direct connection (an edge in the cluster graph) with 
some clusters at super- levels i — \ and i + 1. This property allows to define a tree of 
clusters, which is a BPS tree rooted in a cluster that contain a distinguished node s. 
The broadcasting procedure proposed in [8] uses two types of information transfer in 
clusters, from the top layer through the bottom layer of a super-level, l.e., within each 
cluster we have either slow or fast transfers. The slow transfer is implemented by non- 
optimal broadcasting procedure, while the fast transfer is performed along a single path 
of length — . It is known, that transfers in the tree of clusters can be organised, s.t., on a 
path from any leaf to the root of the cluster tree there is at most 0(log n) clusters oper- 
ating slow transfers. In our gossiping algorithms the slow transfers are implemented by 
limited gossiping (defined below), and fast transfers (as in broadcasting) are performed 
along simple paths. 

In our algorithm we use 3 types of communication procedures: 

1. Limited gossiping - where each node distributes its (currently possessed) mes- 
sage to all nodes within some radius r. Note that if r = D, the limited gossiping 
coincides with the gossiping problem. Note also that slow transfers are based on 
limited gossiping; 

2. Between super-levels - where information residing at the top layer of a lower 
(further from the root cluster) super-level to the bottom layer of an upper super- 
level. This communication procedure is used when at least one cluster of the neigh- 
bouring super-levels is involved in slow transfers; 

3. Fast transfer - where information is moved across one cluster by fast pipelin- 
ing along a single (simple) path. 
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The gossiping algorithm is implemented in 3 stages. 

1. Initially messages in each cluster are collected in a distinguished node (possibly 

belonging to a fast route) in the top layer of each cluster. This is done by Limited 
GOSSIPING, where r = , i.e., maximal diameter of each cluster. Since the 

cluster graph can be coloured with 0(log n) colours, all limited gossipings per- 
formed simultaneously in each cluster (at all super-levels of the cluster tree) can be 
preformed simultaneously with at most log n-time slowdown. I.e., if T2(D) stands 
for the time complexity of limited gossiping in a graph with n nodes, max-degree 
Z\, and diameter D, the contribution of the first stage to the time complexity of our 
gossiping algorithm is ’^)- 

2. Messages from each cluster are delivered to the root cluster and in particular to the 
distinguished node s. During this stage the execution of three types of communi- 
cation procedures is performed in separate (interleaved) time steps. I.e., Limited 
GOSSIPING in time steps i = 0 (mod 3), Between super-level in time steps 
i = I (mod 3), and Fast transfer in time steps i = 2 (mod 3). Also, the 
execution of single rounds of communication procedures (within each type) is syn- 
chronised across all super-levels. E.g., the procedure Limited gossiping starts 
and ends exactly at the same time in each cluster and each super-level. Moreover, 
when new messages arrive at the bottom of a super-level (e.g., delivered by the Be- 
tween SUPER-LEVEL communication procedure) they are buffered at the bottom 
layer and allowed to traverse towards the upper layers only when the new round of 
Limited gossiping or Fast transfer is about to begin. 

A contribution to the time complexity of our gossiping algorithm of each the com- 
munication procedures is as follows: 

(a) Since executions of Limited GOSSIPINGs are synchronised across all super- 
levels, simultaneous execution of a single round of the Limited gossiping 
procedure is done in time T2 ( ■ And since each message, traversing 
towards the root cluster, experiences at most 0(log n) slow transfers (based on 
Limited gossiping) the total contribution to the slowdown of each message 
is bounded by 0(T]^( ^*°^" ) log n). 

(b) A single execution of one round of Between super-levels procedure can 
be implemented in time A. This is a consequence of Lemma 4. Since each 
message has to pass at most x borders between super-levels the contribution of 
this type of communication to the total time complexity is bounded by xA. 

(c) A simultaneous execution of Fast TRANSFER in potentially many clusters on 
the same super-level results in a need of pipelined transmission of messages 
according to the color of a cluster. I.e., during one round of a fast transfer, 
messages that traverse along a path in clusters coloured with number 1 start 
their journey immediately, in all clusters coloured with number 2 messages are 
released three time steps later (in order to avoid collisions between layers), 
in clusters coloured with number 3 - six time steps later, etc. Thus finally in 
clusters coloured with the largest number - 0(log n) time steps later. After 
message is released at the bottom layer it reaches the upper one in exactly 
time steps (a property of a fast transfer). Thus the contribution of this type of 
communication (across all super-levels) to the total time complexity is bounded 
by 0(a;(f -f logn)). 
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3. Eventually, after all messages are successfully gathered in the distinguished node s 
the combined message (containing all original messages) is broadcasted to all other 
nodes in the graph. This can be done by reversing the gathering process presented 
above, where the time complexity remains the same. 

Theorem 3. The time complexity of our gossiping procedure can be expressed by the 
recursive equation: T^{D) = logn + x{A + logn) + D), where x is 

the number of super-levels in the Cluster Graph. 

After iteration of the recursive equation from Theorem 3 i — 1 times, we obtain: 
T^{D)= 0{Tl{^^)\og^ n+x(Z\+logn)(log^-^+...+logn+l)+g( ‘°s^lT" + 

. . . + ” + 1 ) ) • After further substitution of the recursive component by the complexity 

{2D — \)A + 1 (see the gossiping algorithm presented in Section 4.1) and taking x = 
E)i+T log n, we get: 

Corollary 1. T^{D) = 0{D), for all graphs with A = 0{ ^ ^ ) and D = 
l7(log*~*’^ n), for all constant integers i > 0. 
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Abstract. Shareable data services providing consistency guarantees, such as 
atomicity (linearizability), make building distributed systems easier. However, 
combining linearizability with efficiency in practical algorithms is difficult. A re- 
configurable linearizable data service, called Rambo, was developed by Lynch 
and Shvartsman. This service guarantees consistency under dynamic conditions 
involving asynchrony, message loss, node crashes, and new node arrivals. The 
specification of the original algorithm is given at an abstract level aimed at con- 
cise presentation and formal reasoning about correctness. The algorithm prop- 
agates information by means of gossip messages. If the service is in use for 
a long time, the size and the number of gossip messages may grow without 
bound. This paper presents a consistent data service for long-lived objects that 
improves on Rambo in two ways: it includes an incremental communication pro- 
tocol and a leave service. The new protocol takes advantage of the local knowl- 
edge, and carefully manages the size of messages by removing redundant infor- 
mation, while the leave service allows the nodes to leave the system gracefully. 
The new algorithm is formally proved correct by forward simulation using lev- 
els of abstraction. An experimental implementation of the system was developed 
for networks-of-workstations. The paper also includes analytical and preliminary 
empirical results that illustrate the advantages of the new algorithm. 



1 Introduction 

This paper presents a practical algorithm implementing long-lived, survivahle, atomic 
read/write objects in dynamic networks, where participants may join, leave, or fail dur- 
ing the course of computation. The only way to ensure survivability of data is through 
redundancy: the data is replicated and maintained at several network locations. Replica- 
tion introduces the challenges of maintaining consistency among the replicas, and man- 
aging dynamic participation as the collections of network locations storing the replicas 
change due to arrivals, departures, and failures of nodes. 

A new approach to implementing atomic read/write objects for dynamic networks 
was developed by Lynch and Shvartsman [10] and extended by Gilbert et al. [6]. This 
memory service, called Rambo (Reconfigurable Atomic Memory for Basic Objects) 
maintains atomic (linearizable) readable/writable data in highly dynamic environments. 
In order to achieve availability in the presence of failures, the objects are replicated at 

* This work is supported in part by the NSF Grants 9984778, 9988304, 0121277, and 0311368. 

R. Kralovic and O. Sykora (Eds.): SIROCCO 2004, LNCS 3104, pp. 185-196, 2004. 
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several locations. In order to maintain consistency in the presence of small and transient 
changes, the algorithm uses configurations of locations, each of which consists of a set 
of members plus sets of read- and write-quorums. In order to accommodate larger and 
more permanent changes, the algorithm supports reconfiguration, hy which the set of 
members and the sets of quorums are modified. Obsolete configurations can be removed 
from the system without interfering with the ongoing read and write operations. The 
algorithm tolerates arbitrary patterns of asynchrony, node failures, and message loss. 
Atomicity is guaranteed in any execution of the algorithm [10, 6]. 

The original Rambo algorithm is formulated at an abstract level aimed at concise 
specification and formal reasoning about the algorithm’s correctness. Consequently the 
algorithm incorporates a simple communication protocol that maintain very little pro- 
tocol state. The algorithm propagates information among the participants by means of 
gossip messages that contain information corresponding to the sender’s state. The num- 
ber and the size of gossip message may in fact grow without bound. This renders the 
algorithm impractical for use in long-lived applications. 

The gossip messages in Rambo include the set of participants, and the size of these 
messages increases over time for two reasons. First, Rambo allows new participants to 
join the computation, but it does not allow the participants to leave gracefully. In order to 
leave the participants must pretend to crash. Given that in asynchronous systems failure 
detection is difficult, it may be impossible to distinguish departed nodes from the nodes 
that crash. Second, Rambo gossips information among the participants without regard 
for what may already be known at the destination. Thus a participant will repeatedly 
gossip substantial amount of information to others even if it did not learn anything new 
since the last time it gossiped. While such redundant gossip helps tolerating message 
loss, it substantially increases the communication burden. Given that the ultimate goal 
for this algorithm is to be used in long-lived applications, and in dynamic networks 
with unknown and possibly infinite universe of nodes, the algorithm must be carefully 
refined to substantially improve its communication efficiency. 

Contributions. The paper presents a new algorithm for reconfigurable atomic memory 
for dynamic networks. The algorithm, called LL-Rambo, makes implementing atomic 
survivable objects practical in long-lived systems by managing the knowledge accumu- 
lated by the participants and the size of the gossip messages. Each participating node 
maintains a more complicated protocol state and, with the help of additional local pro- 
cessing, this investment is traded for substantial reductions in the size and the number 
of gossip messages. Based on [6, 10], we use Input/Output Automata [11] to specify 
the algorithm, then prove it correct in two stages by forward simulation, using levels 
of abstraction. We include analytical and preliminary empirical results illustrating the 
advantages of the new algorithm. In more detail, our contributions are as follows. 

( 1 ) We develop L-Rambo that implements an atomic memory service and includes a 
leave service (Sect. 3). We prove correctness (safety) of L-Rambo by forward simula- 
tion of Rambo, hence we show that every trace of L-Rambo is a trace of Rambo. 

(2) We develop LL-Rambo by refining L-Rambo to implement incremental gossip 
(Sect. 4). We prove that LL-Rambo implements the atomic service by forward simula- 
tion of L-Rambo. This shows that every trace of LL-Rambo is a trace of L-Rambo, 
and thus a trace of Rambo . The proof involves subtle arguments relating the knowledge 
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extracted from the local state to the information that is not included in gossip messages. 
We present the proof in two steps for two reasons: (z) the presentation matches the in- 
tuition that the leave service and the incremental gossip are independent, and {ii) the 
resulting proof is simpler than a direct simulation of Rambo by LL-Rambo. 

(3) We show (Sect. 5) that LL-Rambo consumes smaller communication resources 
than Rambo, while preserving the same read and write operation latency, which under 
certain steady-state assumptions is at most Sd time, where d is the maximum message 
delay unknown to the algorithm. Under these assumptions, in runs with periodic gossip, 
LL-Rambo achieves substantial reductions in communication. 

( 4 ) We implemented all algorithms on a network-of-workstations. Preliminary empiri- 
cal results complement the analytical comparison of the two algorithms (Sect. 5). 
Background. Several approaches can be used to implement consistent data in (static) 
distributed systems. Many algorithms used collections of intersecting sets of object 
replicas to solve consistency problems, e.g., [2, 14, 15]. Extension with reconfigurable 
quorums have been explored [4], but this system has limited ability to support long- 
lived data when the longevity of processors is limited. Virtual synchrony [3], and group 
communication services (GCS) in general [1], can be used to implement consistent 
objects, e.g., by using a global totally ordered broadcast. The universe of nodes in a 
GCS can evolve, however forming a new view is indicated after a single failure and can 
take a substantial time, while reads and writes are delayed during view formation. 

The work on reconfigurable atomic memory [4, 10, 6] results in algorithms that are 
more dynamic because they place fewer restrictions on the choice of new configura- 
tions and allow for the universe of processors to evolve arbitrarily. However these ap- 
proaches are based on abstract communication protocols that are not suited for long- 
lived systems. Here we provide a long-lived solution by introducing graceful processor 
departures and incremental gossip. The idea of incrementally propagating information 
among participating nodes has been previously used in a variety of different settings, 
e.g., [7, 12]. Incremental gossip is also called anti-entropy [5, 13] or reconciliation [8]; 
these concepts are used in database replication algorithms, however due to the nature of 
the application they assume stronger assumptions, e.g., ordering of messages. 
Document structure. In Section 2 we review Rambo. In Section 3 we specify and 
prove correct the graceful leave service. Section 4 presents the ultimate system, with 
leave and incremental gossip, and proves it correct. Section 5 gives the analysis and 
experimental results. (Complete proofs and analysis are found in MIT/LCS/TR-943.) 

2 Reconfigurable Atomic Memory for Basic Objects (RAMBO) 

We now describe the Rambo algorithm as presented in [10], including the rapid config- 
uration upgrade as given in [6]. The algorithm is given for a single object (atomicity is 
preserved under composition, and multiple objects can be composed to yield a complete 
shared memory). For the detailed Input/Output Automata code see [10,6]. In order to 
achieve fault tolerance and availability, Rambo replicates objects at several network lo- 
cations. In order to maintain memory consistency in the presence of small and transient 
changes, the algorithm uses configurations, each of which consists of a set of members 
plus sets of read-quorums and write-quorums. The quorum intersection property re- 
quires that every read-quorum intersect every write-quorum. In order to accommodate 
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larger and more permanent changes, the algorithm supports reconfiguration, by which 
the set of members and the sets of quorums are modified. Any quorum configuration 
may be installed, and atomicity is preserved in all executions. 

The algorithm consists of three 
kinds of automata: (i) Joiner au- 
tomata, handling join requests, 

(ii) Recon automata, handling re- 
configuration requests and generat- 
ing a totally ordered sequence of 
configurations, and {in) Reader- 
Writer automata, handling read F‘g-1- Rambo architecture depicting automata at 
and write requests, manage config- * “d j, the channels, and the Recon service, 

uration upgrades, and implement 

gossip messaging. The overall systems is the composition of these automata with the 
automata modelling point-to-point communication channels, see Fig. 1 . The Joiner au- 
tomaton simply sends a join message when node z joins, and sends a join-ack message 
whenever a join message is received. The Recon automaton establishes a total ordering 
of configurations (for details see [10]). 

The external signature of the service is in Fig. 2. A client at node z uses join^ ac- 
tion to join the system. After receiving join-ackj, the client can issue read^ and writez 
requests, which result in read-ack^ and write-ack^ responses. The client can issue a 
reconi request a reconfiguration. The fail^ action models a crash at node z. 




Domains: /, a set of processes; V , a set of legal values; and C, a set of configurations, each consisting of members, read- 
and write-quorums 



Input: join(rambo, J)i, J Q I — {i}, i G I, 
such that if 2 = then J — ^ 
readi, i G I 
write(2?)i, V ^ V, i G I 

recon(c, c')i, c, c' ^ C, members(c), I 



Output: join-ack(rambo)i, 2 G I 

read-ack(2?)i, v ^ V, i G I 
write-ackj, i G I 

recon-ack( 6 )i, b G {ok, nok}, i G I 
report(c)i, c ^ C, i G I 



faih , 2 G I 



Fig. 2. Rambo: External signature. 

Every node of the system maintains a tag and a value for the data object. Every 
time a new value is written, it is assigned a unique tag, with ties broken by process- 
ids. These tags are used to determine an ordering of the write operations, and therefore 
determine the value that a read operation should return. Read and write operations has 
two phases, query and propagation, each accessing certain quorums of replicas. As- 
sume the operation is initiated at node z. First, in the query phase, node z contacts read 
quorums to determine the most recent known tag and value. Then, in the propagation 
phase, node z contacts write quorums. If the operation is a read operation, the second 
phase propagates the largest discovered tag and its associated value. If the operation is 
a write operation, node z chooses a new tag, strictly larger than every tag discovered 
in the query phase. Node z then propagates the new tag and the new value to a write 
quorum. Note that every operation accesses both read and write quorums. 

Configurations go through three stages: proposal, installation, and upgrade. First, 
a configuration is proposed by a recon event. Next, if the proposal is successful, the 
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Recon service achieves consensus on the new configuration, and notifies participants 
with decide events. When every non-failed member of the prior configuration has been 
notified, the configuration is installed. The configuration is upgraded when every con- 
figuration with a smaller index has been removed. Upgrades are performed by the 
configuration upgrade operations. Each upgrade operation requires two phases, a query 
phase and a propagate phase. The first phase contacts a read-quorum and a write- 
quorum from the old configurations, and the second phase contacts a write-quorum 
from the new configuration. All three operations, read, write, and configuration up- 
grade, are implemented using gossip messages. 

The cmap is a mapping from integer indices to configurations U{_L, ±}, initially 
mapping every integer to _L. It tracks which configurations are active, which have not 
yet been created, indicated by _L, and which have already been removed, indicated by 
±. The total ordering on configurations determined by Recon ensures that all nodes 
agree on which configuration is stored in each position in cmap. We define c{k) to be 
the configuration associated with index k. 

The record op is used to store information about the current phase of an ongoing 
read or write operation, while upg is used for information about an ongoing configura- 
tion upgrade operation. A node can process read and write operations concurrently with 
configuration upgrade operations. The op. cmap subfield records the configuration map 
associated with the operation. For read or write operations this consists of the node’s 
cmap when a phase begins, augmented by any new configurations discovered during 
the phase. A phase completes when the initiator has exchanged information with quo- 
rums from every valid configuration in op . cmap . The pnum subfield records the phase 
number when the phase begins, allowing the initiator to determine which responses cor- 
respond to the phase. The acc subfield records which nodes from which quorums have 
responded during the current phase. 

Finally, the nodes communicate via asynchronous unreliable point-to-point chan- 
nels. We denote by Channelij the channel from node i to node j. 



3 Rambo with Graceful Leave 

Here we augment Rambo with a leave service allowing the participants to depart grace- 
fully. We prove that the new algorithm, called L-Rambo, implements atomic memory. 

Nodes participating in Rambo communicate by means of gossip messages contain- 
ing the latest object value and bookkeeping information that includes the set of known 
participants. Rambo allows participants to fail or leave without warning. Since in asyn- 
chronous systems it is difficult or impossible to distinguished slow or departed nodes 
from crashed nodes, Rambo implements gossip to all known participants, regardless 
of their status. In highly dynamic systems this leads to (a) the size of gossip messages 
growing without bounds, and (b) the number of messages sent in each round of gossip 
increasing as new participants join the computation. 

L-Rambo allows graceful node departures by letting a node that wishes to leave 
the system to send notification messages to an arbitrary subset of known participants. 
When another node receives such notification, it marks the sender as departed, and 
stops gossiping to that node. The remaining nodes propagate the information about 
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Signature: 

As in RAMBO, plus new actions: 
Input : leave^, recv(leave)^. ^ 
Output : send(leave)^ • 



State: 

As in RAMBO, plus new states: 
leave-world, a finite subset of I, ini-| 
tially 0 

departed, a finite subset of I, initially 

Transitions at i: 

Input recv((iy, D,v,t, cm, pns, pnr ) ) 

Effect: 

if — 'failed A status ^ idle then 
status •«— active 
world •«— world U W 
departed •«— departed U D 



ig G /GMap, initially Vfc G I, 
ig{k) .wk — 0, ig{k) .w-ua — <l 
ig {k) .dk — 0, ig{k) . d-ua — 0, 
ig{k) .p-ack — 0 



Output send((Vy, D,v,t, cm, pns, pnr))ij 
Precondition: 

— 'failed 

status — active 
j G {world — departed) 

{W, D, V, t, cm, pns, pnr) = 



[h]hr-W {j , i, pnr) •«— W 
[h]hr-D{j , i, pnr) •«— D 
ig {j ) . wk •«— ig {j ) . wk U W 
ig{j) .w-ua •«— ig{j) .w-ua — W 
ig(j).dk^ig(j).dk U D 
ig{j) ■ d-ua •«— ig{j ) . d-ua — D 
if pnr > ig{j). p-ack then 
ig{j) .wk •«— ig{j) .wk U ig{j) .w-ua 
ig{j) .w-ua •«— world — ig{j).wk 
ig(j).dk^ig(j).dk U ig(j).d-ua 
ig{j) .d-ua •«— departed — ig{j).dk 
ig{j) .p-ack •«— pnuml 
if f > tag then {value, tag) •«— {v, t) 
cmap •«— update{cmap, cm) 
pnum2{j) •«— ma,x.{pnum2{j), pns) 
if op. phase G {query, propjApnr > op.pnum then 
op. cmap •«— extend{op.cmap, truncate{cm)) 
if op. cmap G Truncated then 
op.acc •«— op.acc U {j} 
else 

op.acc •«— 0 

op. cmap •«— truncate{cmap) 
if upg .phase G {query, prop} A pnr > upg .pnum then 
upg.acc •«— upg.accU {j} 

input recv(leave) j, j 
Effect: 

if ^failed A status — active then 
departed •«— departed U {j} 



{world — ig{j) .wk , departed —ig{j).dk 
value, tag, cmap, pnuml , pnum2 {j) 
Effect: 

pnuml •«— pnuml + 1 
[h]h-msg •«— h-msgU 

{{W, D,v,t, cm, pns, pnr) ,i,j) 



[h\hs-world{i, j , pns) •«— world 
[h]hs-departed{i, j , pns) •«— departed 
[h]hs-wk{i, j , pns) •«— ig{j).wk 
[h]hs-dk{i, j , pns) •«— ig{j).dk 
[h]hs-wua{i, j , pns) •«— ig{j) .w-ua 
[h]hs-dua{i, j , pns) •«— ig{j) .d-ua 
[h]hs-pack{i, j , pns) •«— ig{j) .p-ack 



input leave^ 

Effect: 

if —'failed then 
failed •«— true 

departed •«— departed U {f} 
leave-world •«— world — departed 

output send(leave)i,j 
Precondition: 

j G leave-world 
Effect: 

leave-world •«— leave-world — {j} 



Fig. 3. Modification of Reader-Wrileri for L-Rambo, and for LL-Rambo (the 



boxed 



code). 



the departed nodes to other participants, eventually eliminating gossip to nodes that 
departed gracefully. 



Specification o/L-Rambo. We interpret the fail^ event as synonymous with the leaver 
event - both are inputs from the environment and both result in node i stopping to 
participate in all operations. The difference between fail^ and leavei is strictly internal: 
leaver allows a node to leave gracefully. The well-formedness conditions of Rambo 
and the specifications of Joineri and Recon remain unchanged. The introduction of 
the leave service affects only the specification of the Reader-Writeri automata. These 
changes for L-Rambo are given in Fig. 3, except for the 



boxed 



segments of code that 

should be disregarded until the ultimate long-lived algorithm LL-Rambo is presented 
in Sect. 4 (we combine the two specifications in the interest of space). 

The signature of Reader-Writeri automaton is extended with actions recv(leave)^ ^ 
and send(leave)^ ^ used to communicate the graceful departure status. The state of 
Reader-Writeri is extended with new state variables: departedi, the set of nodes that left 
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the system, as known at node i, leave-worldi, the set of nodes that node i can inform of 
its own departure, once it decides to leave and sets leave-worldi to world — departed. 

The key algorithmic changes involve the actions recv(m)^ ^ and send(m)j^. The 
original Rambo algorithm gossips message m includes: W the world of the sender, 
V the object and its tag t, cm the cmap, pns the phase number of the sender, and pnr 
the phase number of the receiver that is known to the sender. The gossip message m in 
L-Rambo also includes D, a new parameter, equal to the departed set of the sender. 

We now detail the leave protocol. Assume that nodes i and j participate in the 
service, and node i wishes to depart following the leaver event, whose effects set the 
state variable failedi to true in Joinevi, Recoiii, and Reader-Writeri. The leavei action 
at Reader-Writeri (see Fig. 3) also initializes the set leave-worldi to the identihers 
found in worldi, less those found in departedi. Now Reader-Writeri is allowed to send 
one leave notification to any node in leave-worldi. This is done by the send(leave)i j 
action that arbitrarily chooses the destination j from leave-worldi. Note that node i 
may nondeterministically choose the original fail^ action, in which case no notihcation 
messages are sent (this is the “non-graceful” departure). 

When Reader-Writeri receives a leave notification from node j, it adds j to its 
departedi set. Node i sends gossip messages to all nodes in the set worldi — departedi, 
which including information about j’s departure. When Reader-Writeri receives a gos- 
sip message that includes the set D, it updates its departedi set accordingly. 

Atomicity of L-Rambo service. The L-Rambo system is the composition of all 
Reader-Writeri and Joineri automata, the Recon service, and Channekj automata for 
all i,j e /. We show atomicity of L-Rambo by forward simulation that proves that any 
trace of L-Rambo is also a trace of Rambo, and thus L-Rambo implements atomic 
objects. The proof uses history variables, annotated with the symbol [h] in Fig. 3. 

For each i we define h-mspi to be the history variable that keeps track of all 
messages sent by Reader-Writeri automata. Initially, h-mspi = 0 for all i G I. 
Whenever a message m is sent by i to some node j G I via Channelij, we let 
h-mspi ^ h-mspi U {(m, i,j)}. We define h-MSG tohe Uie/ h-mspi. (The remaining 
history variables are used in reasoning about LL-Rambo, see Sect. 4). 

The following lemma states that only good messages are sent. 

Lemma 1. In any execution of L-Rambo, if m is a message received by node i in 
a recv(m)jj- event, then G h-MSG, and m = {W, D,v,t,cm,pns,pnr) or 

m = leave or m = join. 

Next we show that L-Rambo implements Rambo, assuming the environment be- 
havior as (informally) described in Sect. 2. Showing well-formedness is straightforward 
by inspecting the code. The proof of atomicity is based on a forward simulation rela- 
tion [9] from L-Rambo to Rambo. 

Theorem 1. L-Rambo implements atomic read/write objects. 

4 Rambo with Graceful Leave and Incremental Gossip 

Now we present, and prove correct, our ultimate algorithm, called LL-Rambo (Long- 
Lived Rambo). The algorithm is obtained by incorporating incremental gossip in L- 
Rambo, so that the size of gossip messages is controlled by eliminating redundant in- 
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formation. In L-Rambo (resp. Rambo) the gossip messages contain sets correspond- 
ing to the sender’s world and departed (resp. world) state variables at the time of the 
sending (Fig. 3). As new nodes join the system and as participants leave the system, 
the cardinality of these sets grows without bound, rendering Rambo and L-Rambo 
impractical for implementing long-lived objects. The LL-Rambo algorithm addresses 
this issue. The challenge here is to ensure that only the certifiably redundant information 
is eliminated from the messages, while tolerating message loss and reordering. 
Specification o/LL-Rambo. We specify the algorithm by modifying the code of L- 
Rambo. In Fig. 3 the boxed segments of code specify these modihcations. The new 
gossip protocol allows node i to gossip the information in the sets worldi and departed i 
incrementally to each node j G worldi — departedi. Following j’s acknowledgment, 
node i never again includes this information in the gossip messages sent to j, but will 
include new information that i has learned since the last acknowledgment by j. 

To describe the incremental gossip in more detail we consider an exchange of a 
gossip messages between nodes i and j, where i is the sender and j is the receiver. The 
sets world and departed are managed independently and similarly, and we illustrate 
incremental gossip using just the set world. First we define new data types. Let an 
incremental gossip identifier be the tuple {wk, dk, w-ua, d-ua, p-ack), where wk, dk, 
w-ua, and d-ua are finite subsets of /, and p-ack is a natural number. Let IG denote 
the set of all incremental gossip identifiers. Finally, let IGMap be the set of incremental 
gossip maps, defined as the set of mappings I ^ IG. We extend the state of the Reader- 
Writeri automaton with ig^ G IGMap. Node i uses ig{j)i tuple to keep track of the 
knowledge it has about the information already in possession of, and currently being 
propagated to, node j (see Fig. 3). Specifically, for each j G worldi, ig{j)i-wk is the set 
of node identifiers that i is assured is a subset of world j, ig{j)i.w-ua is the set of node 
identifiers, a subset of worldi, that j needs to acknowledge. The components ig{j)i.dk 
and ig{j)i.d-ua are defined similarly for the departed set. Lastly, ig{j)i. p-ack is the 
phase number of i when the last acknowledgment from j was received. Initially each of 
these sets is empty, and p-ack is zero for each ig{j)i with j G I. 

Node j acknowledges a set of identifiers by including this set in the gossip message, 
or by sending a phase number of i such that node i can deduce that node j received 
this set of identihers in some previous message from i to j. Messages that include Fs 
phase number that is larger than ig {j ) i .p-ack are referred to as fresh or acknowledgment 
messages, otherwise they are referred to as late messages. (This is discussed later.) 

The lines annotated with [h] in Fig. 3 deal with history variables that are used only 
in the proof of correctness. 

In Rambo, once node i learns about node j, it can gossip to j at any time. We now 
examine the send{{W, D,v,t,cm,pns,pnr))ij action. The world component, W, is 
set to the difference of worldi and the information that i knows that j has, ig{j)i.wk, at 
the time of the send. Remaining components of the gossip message are the same as in 
L-Rambo. The effect of the send action causes phase number of the sender to increase; 
this ensures that each message sent is labeled with a unique phase number of the sender. 

Now we examine rec\/{{W, D,v,t,cm,pns,pnr))ij action at j (note that we 
switch i and j relative to the code in Fig. 3 to continue referring to the interaction of the 
sender i and receiver j). The component W contains a subset of node identifiers from 
j’s world. Hence W is always used to update world j, ig{i)j .wk, and ig{i)j .w-ua. The 
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update of world j is identical to that in L-Rambo. By definition ig{i)j -wk is the set of 
node identihers that j is assured that i has, hence we update it with information in W. 
Similarly, by definition ig{i)j.w-ua is the set of node identifiers that j is waiting for i to 
acknowledge. It is possible that i has learned some or all of this information from other 
nodes and it is now a part of W , hence we remove any identifiers in W that are also in 
ig{i)j -w-ua from ig{i)j .w-ua', these identifiers do not need further acknowledgment. 

What happens next in the effect of recv depends on the value of pnr (the phase num- 
ber that i believes j to be in). First, if pnr < ig{i)j .p-ack, this means that this message 
is a late message since there must have been a prior message from i to j that included 
this or higher pnr. Hence, no updates take place. Second, if pnr > ig{i)j .p-ack, this 
message is considered to be an acknowledgment message. By definition ig{i)j .p-ack 
contains the phase number of j when last acknowledgment from i was received. Fol- 
lowing last acknowledgment, phase number of j was incremented, ig{i)j .p-ack was 
assigned the new value of phase number of j, and lastly new set of identifiers to be 
propagated was recorded. Since node i replied to j with phase number larger than 
ig{i)j .p-ack it means that j and i exchanged messages where i learned about the new 
phase number of j, by the same token i also learned the information included in these 
messages. (We show formally that ig{i)j .w-ua is always a subset of each message com- 
ponent W that is sent to i by j.) Hence, it is safe for j to assume that i at least received 
the information in ig{i)j .w-ua and to add it to ig{i)j.wk. 

Since the choice of i and j is arbitrary, gossip from j to i is defined identically. 
Atomicity of LL-Rambo. We show that any trace of LL-Rambo is a trace of L- 
Rambo, and thus a trace of Rambo. We start by defining the remaining history vari- 
ables used in the proofs. These variables are annotated in Fig. 3 with a [h] symbol. 

- For every tuple G h-msg^, where m = {W, D,v,t,cm,pns,pnr) and 

pns = p, the history variable hs-W {i,j ,p) is a mapping from / x / x IN to 
2^ U {-L}. This variable records the world component of the message, W, when i 
sends message m to j, and z’s phase number is p. Similarly, we define a derived 
history variable hs-D{i,j, p), a mapping from / x / x IN to 2^ U {-L}. This history 
variable records the departed component of the message, D, when i sends message 
m to j, and z’s phase number is p. 

Now we list history variables used to record information for each send((LF, D, v, t, cm, 
pns,pnr))ij event. 

- Each of the following variables is a mappings from / x / x IN to 2^ U 
{±}. hs-world{i,j ,pns) records the value of worldi, hs- departed {i,j, pns) 
records the value of departedi, hs-wk{i,j , pns) records the value of ig{j)i.wk, 
hs-dk{i,j ,pns) records the value of ig{j)i.dk, hs-wua{i,j , pns) records the value 
of ig{j)i. w-ua, and hs-dua{i,j,pns) records the value of ig{j)i.d-ua. 

- hs-pack{i,j , pns) is a mapping from I x I x IN to IN. It records the value of 
ig{,j)i. p-ack. 

The last history variables record information in messages at each recv((LF, D, v, t, cm, 
pns,pnr))j^i event. 

- Each of the following is a mapping from I xl x IN to 2-^ U {-L}. hr- W {j, i, pns) 
records the component W {world) and hr-D{j, i, pns) records the component D 
{departed). 
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We begin by showing properties of messages delivered by Reader-Writer processes. 

Lemma 2. Consider a step (s,e, s') of an execution a o/LL-Rambo, where e = 
rec\/{{W,D,v,t,cm,pj,pi))j^ifori,j G I, andpi > s.ig{j)i.p-ack. Then, 

(a) s.ig{j)i.p-ack = s.hs-pack{i,j,pi), (b) s.ig{j)i.w-ua C s.hs-wua{i,j,pi), 

(c) s.ig{j)i.d-ua C s.hs-dua{i,j,Pi). 

Invariant 1 is used in proving the correctness of LL-Rambo. In Invariant I, parts 
(a) to (e) and Lemma I are used to show the key parts (f) and (g). 

Invariant 1 For all states s of any execution a o/LL-Rambo.' 

(a) {(yV, D, V, t, cm,pns,pnr) ,i, j) G s.h-MSG C s.worldiAD C s. departed^, 

(b) \f i,j G I : s.ig{j)i.w-ua C s.worldi — s.ig{j)i.wk, 

(c) \/i,j G I : s.ig{j)i.d-ua C s.worldi — s.ig{j)i.dk, 

(d) {{W, D,v,t, cm,p,pnr) ,i, j) G s.h-MSG s.hs-wua{i,j, p) C W, 

(e) {{W, D, V, t, cm,p,pnr), i,j) G s.h-MSG s.hs-dua{i,j,p) C D, 

(f) ^ hi G I : s.ig{j)i.wk C s.worldj, and 

(g) '^ hi G I • s.ig{j)i.dk C s. departed j . 

Parts (f) and (g) of Invariant I show that no node overestimates the knowledge 
of another node about its world and departed sets. Finally we show the atomicity of 
objects implemented by LL-Rambo by proving that it simulates L-Rambo, i.e., by 
showing that every trace of LL-Rambo is a trace of L-Rambo (hence of Rambo). 

Theorem 2. LL-Rambo implements atomic read/write objects. 

Proof. (Sketch) We define a relation R to map (a) a state t of LL-Rambo to a state 
s of L-Rambo so that every “common” state variable has the same value (e.g., for 
i G I, t.worldi = s.worldi, t.pnumli = s.pnumli, etc.) and (b) a message 
m = (yV, D, V, t, cm,pns,pnr) in the Channel automaton of LL-Rambo to a mes- 
sage m' = {W, D, V, t, cm,pns,pnr) in the Channel automaton of L-Rambo so that: 
m.v = m'.v, m.t = m! .t, m.cm = m' .cm, m.pns = m' .pns, m.pnr = m'.pnr, 
m.W = hs-world{i,j ,pns) — hs-wk{i , j , pns) and m'.W = hs-world{i,j,pns), and 
m.D = hs- departed {i, j , pns) — hs-dk{i, j , pns) and m'. 19 = hs-departed{i, j , pns) . 
Using the specifications of the two algorithms and Invariant 1, we show that R is a sim- 
ulation mapping from LL-Rambo to L-Rambo. Since L-Rambo implements atomic 
objects per Theorem 1, so does LL-Rambo. 



5 LL-Rambo Implementation and Performance 

We developed proof-of-concept implementations of Rambo and LL-Rambo on a 
network-of- workstations. In this section we presents preliminary experimental results 
and overview conditional analysis of algorithms. 

Experimental Results. We developed the system by manually translating the In- 
put/Output Automata specification to Java code. To mitigate the introduction of er- 
rors during translation, the implementers followed a set of precise rules that guided the 
derivation of Java code. The platform consists of a Beowulf cluster with ten machines 
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running Linux. The machines are various Pentium processors up to 900 MHz inter- 
connected via a 100 Mbps Ethernet switch. The implementation of the two algorithms 
share most of the code and all low-level routines, so that any difference in performance 
is traceable to the distinct world and departed set management and the gossiping disci- 
pline encapsulated in each algorithm. 

We are interested in long-lived applications and we assume that the number of par- 
ticipants grows arbitrarily. Given the limited number of physical nodes, we use majority 
quorums of the these nodes, and we simulate a large number of other nodes that join the 
system by including such node identifiers in the world sets. Using non-existent nodes 
approximates the behavior of a long-lived system with a large set of participants. How- 
ever, when using all-to-all gossip that grows quadratically in the number of participants, 
it is expected that the differences in Rambo and LL-Rambo performance will become 
more substantial when using a larger number of physical nodes. 

The experiment is designed as follows. There are ten nodes that do not leave the 
system. These nodes perform concurrent read and write operations using a single con- 
figuration (that does not change over time), consisting of majorities, i.e., six nodes. 
Figure 4 compares (a) the average latency of gossip messages and (b) the average la- 
tency of read and write operations in Rambo and LL-Rambo, as the cardinality of 
world sets grows from 10 to 7010. 

LL-Rambo exhibits substantially better 
gossip message latency than Rambo (Fig. 4(a)). 

In fact the average gossip latency in LL-Rambo 
does not vary noticeably. On the other hand, the 
gossip latency in Rambo grows substantially as 
the cardinality of the world sets increases. This 
is expected due to the smaller incremental gos- 
sip messages of LL-Rambo, while in Rambo, 
the size of the gossip messages is always pro- 
portional to the cardinality of the world set. LL- 
Rambo trades local resources (computation and 
memory) for smaller and fewer gossip messages. 

We observe that the read/write operation latency 
is slightly lower for Rambo when the cardinal- 
ity of the world sets is small (Fig. 4(b)). As the 
size of the world sets grows, the operation la- 
tency in LL-Rambo becomes substantially bet- 
ter than in Rambo. 

Performance analysis. We briefly summarize 
Rambo. Here we assume that the participating nodes perpetually gossip with a 
period d. We show that the latency of read and write operations matches that of 
Rambo [10,6]. Specifically, if d is the maximum message delay, then read and write 
operations take at most 8d time, when reconfigurations are not too frequent. We analyze 
the communication of LL-Rambo in the following scenario. We consider the scenario 
where, once an object is created, several nodes join the system, such that, together with 
the creator, there are n nodes. Then I nodes leave the system, such that the number of 




Fig. 4. Preliminary empirical results: 
(a) gossip message latency, (b) read and 
write latency. 



our performance analysis of LL- 
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remaining active nodes is a = n — I . We show that in this case, after r rounds of gossip, 
the savings in gossip messages for LL-Rambo are between I7(r • n) and 0{r ■ n^). 

6 Discussion and Future Work 

We presented an algorithm for long-lived atomic data in dynamic networks. Prior solu- 
tions for dynamic networks [10, 6] did not allow the participants to leave gracefully and 
relied on gossip that involved sending messages whose size grew with time. The new 
algorithm, called LL-Rambo improves on prior work by supporting graceful depar- 
tures of participants and implementing incremental gossip. The algorithm substantially 
reduces the size and the number of gossip messages, leading to improved performance 
of the read and write operations. Our improvements are formally specified and proved. 
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Abstract. We stndy the problem of designing fanlt tolerant rontings 
in complete and complete bipartite optical networks. We show that this 
problem has strong connections to varions fnndamental problems in de- 
sign theory. Using design theory approach, we find optimal /-tolerant 
arc-forwarding indexes for complete networks of a prime power order, 
and all complete bipartite networks. Similarly, we find almost exact val- 
nes for /-tolerant optical indexes for these networks. Onr work motivates 
an interesting relaxation of an extensively stndied problem on mutually 
orthogonal Latin squares. 



1 Introduction 

Routing communication demands is one of the fundamental problems in the area 
of networking. One of the most recognized recent applications of the problem is 
in the area of optical networking [1]. Most of the research concentrates on deter- 
mining two basic invariants of a given optical network - the arc-forwarding and 
optical indexes [3] . In [8] , fault-tolerant issues of optical networks were considered 
and the two invariants were generalized into so called f -tolerant arc-forwarding 
and f -tolerant optical indexes. The parameter / represents the number of faults 
that are tolerated in the optical network. To determine the arc-forwarding index 
of a network, one has to design a delicate path system which uses every link 
in the network evenly. Unfortunately, since most of the known constructions of 
such systems are ad hoc, possible connections of this problem with other areas 
of computer science and mathematics remain hidden. 

The aim of this paper is twofold. First, we try to understand why path layout 
problems are hard even for very simple networks. In particular, we explore con- 
nections between the construction of path layouts in simple networks and some 
hard problems in design theory. This leads us to postulate an interesting con- 
jectures on Latin squares. Second, using tools from design theory, we determine 
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/-tolerant arc-forwarding and (almost determine) /-tolerant optical indexes of 
complete and complete bipartite networks for many values of /. 

We model an all-optical network as a symmetric directed graph G with vertex 
set V{G) and arc set A{G), i.e., if (u,u) G A{G) then {v,u) G A{G). Let P{u,v) 
denote a directed path in G from u to v. The set 



= {Pi{u,v) : (u,v) G V{G), u^v, i = 0,...,f} 

where for each ordered pair (u,v) G V{G) x V{G) of distinct nodes, the paths 
Pq{u,v), . . . , Pf{u, v) are internally node disjoint is called f- fault tolerant rout- 
ing. As noted in [2], when links and/or nodes may fail in a network, it is im- 
portant to establish connections for a required communication demand, that 
guarantee fault-free transmission. Assuming that at most / links/nodes may fail 
in the network, the set TZf{G) will obviously provide such a routing. The routing 
TZf{G) can be viewed as an embedding of a complete /-multi-digraph on n nodes 
into G where n is the cardinality of V (G) . Most of the known work concentrated 
on 0-fault tolerant routings TZo{G). 

In practical applications the number of different signals a link can carry, is 
limited. Therefore, the goal is to design a routing which minimizes the maximum 
load on arcs. Let Tr{TZf{G)) denote the maximum load on arcs, that is, the 
maximum number of times an arc of G appears in directed paths of TZf{G). 
Then 

7t/(G) = min 7r(7^/(G)) 

is called the f -tolerant arc-forwarding index of G. Note that the 0-tolerant arc- 
forwarding index is the well-known arc- forwarding index, cf. [9] . 

In an optical network, a request is serviced by sending a signal on a specific 
wavelength over the entire path assigned to the request by the routing. Therefore, 
whenever two paths of the routing share an arc, they must be assigned to different 
wavelengths. Let w{TZf{G)) be the smallest number of wavelengths needed to 
assign to the directed paths of TZf{G) so that no two paths that share an arc 
receive the same wavelength. Then 



Wf{G)= mm w{nf{G)) 

K-syG) 

is called the f -tolerant optical index of G. Again, the 0-tolerant optical index is 
equivalent to the well-studied optical index. 

Consider an /-fault tolerant routing TZf{G). We say that the routing TZf 
is optimal if its congestion achieves the /-tolerant arc-forwarding index 7ry(G), 
and it is balanced if the difference between congestions of any two arcs is at 
most one. For any z = 0, . . . , /, the level i of the routing 7^/ is the set of paths 
Pi{u,v) G TZf, for all M yf v. It follows that for any f < f, the subrouting 
TZf'{G) consisting of levels 0, . . . , /', is /'-fault tolerant. We say that an optimal 
balanced routing TZf{G) is levelled if every one of its subroutings is also optimal 
and balanced. A feature of a levelled /-tolerant routing is that it can easily be 
reconfigured when there is a change in the requirement on the number of faults 
the routing should tolerate. 
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In subsequent sections, we establish a close connection between levelled /- 
fault tolerant routings in the complete digraph and the complete bipartite 
digraph Kn,n and the existence of / + 1 mutually orthogonal Latin squares. 
Since the later is a very hard problem, this indicates that determining 7 t/(G) 
for / > 0 is much harder than determining 7 To(G) even for very simple graphs 
G. Using tools from design theory, we calculate TVf{Kn) for every / up to the 
maximal possible value n — 2 when n is a power of prime, while for other n, the 
maximal value of / depends on some number-theoretic properties of n. Further, 
we calculate TTf(Kn^n) for every / up to its maximal value n — 1. We also 
determine almost optimal values for the /-tolerant optical indexes for these 
graphs. We believe that this design theory approach can also be used for other 
graphs G. Note that design theory approach has already been used in a related 
problem - minimizing the number of ADM switches in WDM ring networks, see 
for example [4]. 



2 Preliminaries 



Let p be a path in a graph G. The length of p is the number of arcs of p. The 
distance of two nodes u and u of G is the length of a shortest path connecting 
u and V. We generalize distance as follows. The k-distance of u and v, denoted 
by dk(u,v), is the minimal sum of lengths of k internally node disjoint paths 
connecting u and v. 

We start with the following obvious lower bound for the /-fault tolerant 
forwarding index. 



Proposition 1. For any graph G = (V, E) and f < \V\ — 2, 



^f{G) > 




^ df+i{u,v) 

u,v^V 



Let G be a symmetric digraph. To upper bound the optical index of the 
graph G, take an /-fault tolerant routing TZ = TZf{G) of G. We can construct 
an undirected path graph Q-j^ = G{71, E{TZ)) as follows: two paths P,P'€TZ are 
connected by an edge if P and P' share an arc. Obviously, the minimum number 
of wavelength needed for TZ is the same as the chromatic number of the path 
graph Qji which is at least 7ry(G). Hence, we have the following lower bound for 
the optical index. 



Proposition 2. For any symmetric digraph G with connectivity k, and any 

0 < / < fc, 

7t/(G) < Wf{G). 

It is a well known open problem for 0-fault tolerant routings to show that 
7 To(G) = Wo{G) for any symmetric digraph G. This is the case for many exten- 
sively studied interconnection networks and was recently also proved for sym- 
metric trees, cf. [7]. We believe the same relation is true for /-tolerant routings 
and repeat here our conjecture from [8]. 
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Conjecture 1. Let G be a symmetric digraph with connectivity k. For any 0 < 
f < k, 

TTf(G) = Wf(G). 



3 Complete Digraphs 

In this section, we consider complete digraphs Kn and determine their /-tolerant 
arc-forwarding and (almost determine) their /-tolerant optical indexes. This is 
achieved by reformulating these problems as design theory problems. In par- 
ticular, we show that these problems are very closely related to the existence 
of mutually orthogonal Latin squares, and propose some interesting conjectures 
about Latin squares. 

Consider a complete digraph K„ with V (Kn) = V where C is a set of 
cardinality n and an /-fault tolerant routing TZf{Kn)- Necessarily, / < n — 2. 

Take a pair of distinct nodes u,v € V. Consider the / -I- 1 paths connecting 
u and V. At most one of these paths can have length 1. Therefore, d/_|-i(M, u) > 
2/ -|- 1. Since the number of pairs (u,v) is the same as the number of arcs, by 
Proposition 1, 7 t/(1T„) > 2/ -|- 1. 

In what follows, we would like to design a levelled /-fault tolerant routing 
attaining this lower bound. However, this is not always possible. For example, an 
exhaustive computer search has shown that there is no levelled /-fault tolerant 
routing for n = 6 and / > 2. 

First, we will show that for every prime power n there is a levelled (n— 2)-fault 
tolerant routing. 

Let us start with three definitions on Latin squares. 

Definition 1. A Latin square of order n is an nx n array with entries from a 
set S of cardinality n such that every row (resp. column) contains each symbol 
of S exactly once. 

Formally, a Latin square of order n is a pair {S, L) where L is a mapping 
L : S X S ^ S such that for any u,w G S, the equation 

L{u,v) = w (resp. L{v,u) = w) 
has a unique solution v G S. 

In other words the row mapping 

: S ^ S defined as 

Ru{v)=L{u,v) and C(({v) 
are permutations. 

A Latin square is normally written as an n x n array for which the cell in 
row u and column v contains the symbol L(u,v). 

Definition 2. We say that a Latin square (S,L) is idempotent if for every 
u G S, L{u, u) = u. 



: S —>■ S and the column mapping 
= L{v, u) for all u,v G S, 
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Definition 3. We say that two Latin squares {S,Li) and (S', L 2 ) are indepen- 
dent if for all u^v, Li{u,v) ^ L 2 {u,v). 



Note that the property of independence of two idempotent Latin squares 
(S, Li) and (S, L 2 ) is weaker than the property of orthogonality which has been 
studied extensively, cf. [6]. Two Latin squares (S, Li) and ( 5 ,^ 2 ) are orthogonal 
if 

{[Li{u,v), L2{u,v)]-, u,v G S} = S X S . 



Indeed, orthogonal idempotent Latin squares (S, Li) and (S, L 2 ) are also 
independent, since if Li{u,v) = L 2 {u,v) = w for some u^v then 



[Li{u,v),L2{u,v)] = [w,w] = [Li{w,w),L2{w,w)] 



which contradicts orthogonality. 

It has been shown in [12] that there are no 2 orthogonal Latin squares of order 
6. Note that it is a fundamental problem of design theory to find the maximum 
number of mutually orthogonal Latin squares for each order n. On the other 
hand, it is not hard to find two independent idempotent Latin squares of order 
6. Hence, it should be easier to find n — 2 mutually independent idempotent 
Latin squares than mutually orthogonal Latin squares. 

If n is a prime power, an elementary construction in [11] shows that there 
exist n — 2 mutually orthogonal idempotent Latin squares of order n, and hence 
n — 2 mutually independent idempotent Latin squares. 

Lemma 1 ([11]). Let n = be a prime power. Then there exist n — 2 mutually 
orthogonal idempotent Latin squares. 

Let us recall the construction. 



Construction 1. Let (F, •, -I-) be a finite field of order n = p'’. Then for every 
t € F — {0, 1} and u, u G F define 

L*{u,v) = i ■ u + (1 — i) ■ V . 

Theorem 1. Lf n is a prime power we can construct a levelled {n — 2)-fault 
tolerant routing for K„ in polynomial time. Ln particular, TVf{Kn) = 2f +1 for 
all f < n — 2. 

Proof. Assume that n and / are chosen so that there exists a levelled /-fault 
tolerant routing TZf{Kn) with congestion 2/ -|- 1. We show that such routing 
exists when n is a prime power and / < n — 2. Obviously, the subrouting TZg^Kn) 
of TZf{Kn) contains only paths of length 1, and each consecutive level contains 
only paths of length 2. Therefore the levelled routing TZf{Kn) can be described 
using a system of functions T) : VxV^V,i=l,...,f defined as the union 
of the following sets: 

Vo = U„^„{Po(u, v)=u^ u}, 

Vi = \Juiiv{Pi{u,v) = Fi{u,v) u}, for every z = 1 , . . . , /, 



(P) 
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where the system of functions {-FijiLi satisfies conditions (1), (2) and (3) de- 
scribed bellow. 

First, we have to guarantee that the Pi’s are paths, i.e. that for all i and for 
all u =/= V, 

F^{u,v)^u,v. ( 1 ) 

Next, for every i = 1, . . . , / and every pair of distinct nodes u,v € V, consider 
all paths of level i using the arc u v. Since, our system is levelled, there are 
exactly two such paths. We can divide them into two groups: = 

{Pi{u,z)] Fi{u,z) = u} containing the paths which use the arc u ^ v as the 
first arc of the path, and ’{u,v) = {Pi{z,v)] Fi{z,v) = u} containing the 
paths which use the arc u v as the second arc. The second condition is that 

|M«(u,u)| + |Mf)Ku)| = 2 (2) 

for every u v &V and i = 1, . . . , /. 

Finally, we have to ensure that the paths from u to u on different levels are 
internally node disjoint: for every u ^ v and i ^ j, 



F,{u,v) ^ Fj{u,v) . (3) 

To find the required routing TZf(Kn), it is enough to find a system of func- 
tions {Fi}(^-^ satisfying conditions (1), (2) and (3). To make the system more 
symmetric, and hence easier to design, we will replace condition (2) by the 
stronger condition that 



|M«(u,z;)| = |Mf (u,u)| = l (2’) 

for all u,v and i. Note that we no longer require u=/= v. Conditions (2’) and (1) 
imply that 

F^{u,u)=u (F) 

for all u G V. On the other hand, conditions (2’) and (F) imply (1). Hence, it is 
enough to find a system of functions {Fi}(^-^ satisfying conditions (!’), (2’) and 
(3). Such a system of functions corresponds to a sequence of / mutually inde- 
pendent idempotent Latin squares. This follows from the following two claims. 

Claim. If a system of functions satisfies condition (2’) then {V, Fi) is a 

Latin square for each i = 1, . . . , /. 

Proof. Fix an i. For all u, v we have the following two implications: 

\M-^\u,v) \ = 1 there exists exactly one z such that Fi{u,z) = v, 

\M- Tu,u)| = 1 => there exists exactly one z such that Fi{z,v) = u, 

which implies that (V, Fi) is a Latin square. 
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Claim. The system of functions which satisfies conditions (!’), (2’) and 

(3) corresponds to the sequence of / mutually independent idempotent Latin 
squares. 

Proof. Obviously, for a system of functions {TijiLi which satisfies (!’) we have 
that (S', Fi) is an idempotent Latin square, for every i = 1, . . . , 

Obviously, for a system of functions which satisfies (3), we have that 

the sequence of Latin squares {S, Fi), . . . , {S, Ff) is mutually independent. 

Now, the proof of the theorem follows from Claim 3 and Lemma 1 immedi- 
ately. 

If n is not a prime power then there may not exist n — 2 mutually independent 
idempotent Latin squares of order n. We observed by an exhaustive computer 
search that: 

— for small even n’s which are not powers of 2 , there do not exist n — 2 such 
Latin squares; 

— for small odd n’s, there exist n — 2 such Latin squares, and the number of 
such sequences grows very quickly. 

This fact motivates us to pose the following conjecture. 

Conjecture 2. For each odd n > 3, there exist n — 2 mutually independent idem- 
potent Latin squares of order n. 

In the full version, we propose an approach for solving the conjecture and 
reformulate it as a very interesting problem about permutations. For general n 
we can use the direct product composition of Latin squares to construct p’’ — 1 
mutually orthogonal Latin squares, where p'" is the smallest maximal prime 
power dividing n, i.e, p’' = min{p(^CP 2 ^, . . . ,p^)f}, where n = p^plf^ . . .p]f and 
pi,P 2 , . . . ,Pk are distinct primes, cf. [10]. Using the same technique we can give 
a similar result directly formulated for mutually independent idempotent Latin 
squares. Before proving the result we need the following definition. 

Definition 4. The direct product Ax B of two Latin squares of orders m and 
n respectively is the Latin square of order mn defined by 

{A X B){[ai,bi], [ 02 , 62 ]) = [A(oi, 02 ), 5 ( 61 , 62 )] . 

Lemma 2. Let Ai, . . . , A^ be k mutually independent idempotent Latin squares 
of order m and let B\, . . . , Bk be k mutually independent idempotent Latin square 
of order n. Then Ai x B\, . . . ,Ak x B^ are k mutually orthogonal idempotent 
Latin squares of order mn. 

The proof will appear in the full version. 

Now, we can combine Lemma 2 with Claim 3 to obtain the following. 

Theorem 2. Let n = p^ ■■■Pk’ where p\,...,pk are distinct primes and let 
p’’ = min{pl[\ . . . ,p]f}. We can construct a levelled {p’’ — 2) -fault tolerant routing 
for Kn in polynomial time. Ln particular, 7 ry(iT„) = 2/ -|- 1 for all f < p’^ — 2. 
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In most practical applications this is sufficient, since an /-tolerant system 
requires at least 2f+l wavelengths to send on one optical link but the bandwidth 
of an optical cable is limited. 

3.1 An Upper Bound for the Optical Index 

Consider a levelled /-fault tolerant routing TZf(Kn) = described in (P), 

where Fi, . . . , Ff are independent idempotent Latin squares. We will consider the 
subroutings Vq, ... ,Vf separately. For each we build the path graph and upper 
bound its chromatic number. Then, by assigning different sets of wavelength for 
the paths in different subroutings, we have the following upper bound for the 
wavelength number of TZf(Kn): 



f 

wiUfiKr,)) <J2x{Gv,) ■ 

i=0 

Lemma 3. Consider a levelled f -fault tolerant routing described in (P), 

where f < n — 2 and F\, . . . ,Ff are independent idempotent Latin squares. Then 
x(Gvo) = 1; and for every z = 1, . . . , /, xiGvi) < 3. 

Proof. The path graph Q-pg contains no edges, hence its chromatic number is 1. 
Consider the path graph Qp^ for some i = 1, . . . , /. By (2), the degree of each 
node of fj-p. is exactly 2, i.e., the path graph is a collection of cycles. Obviously, 
the path graph can be coloured using 3 colours. 

Note that it can happen that the cycles of the path graph are of odd lengths, 
hence two colours are not enough to color the path graph Qp. . We have the 
following upper bound for the optical index of K„. 

Theorem 3. Let n = p^^ where pi,...,pk are distinct primes and let 

p^ = min{p\^ , . . . ,p^*‘}. For every f = 0, . . . , (p’’ — 2), there exists a levelled 
f -fault tolerant routing TZf{Kn) with wavelength number w{TZf{Kn)) < 3/-I- 1. 
Consequently, 

2/+l<my(iT„) <3/+l. 

4 Complete Bipartite Digraphs 

Consider a complete bipartite digraph Kn^n with node partitioning U = {uq, ■ . ■ , 
M„_i} and V = {vo, . . . ,Vn-ij. Let TZf{Kn,n) be an /-fault tolerant routing. 
Necessarily, f < n — 1. 

Take a pair of distinct nodes x,y G U LI V. If x and y belong to the same 
partition then every path from x to y has length at least 2, and thus we have 
df+i{x,y) > 2{f -\- 1). If X and y belong to different partitions then there is 
only one path connecting x and y of length 1, and all other paths have length 
at least 3. Hence, in this case, d/+i(a;,y) >3/4-1. By Proposition 1, we have 
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TVf{Kn,n) > (2n(n - 1) • 2(/ + 1) + 2n^(3/ + 1)) 



f5/ + 3, 

hf + 2, 

[5/ + I, if/ = n-l. 



We will show a construction of an {n — l)-fault tolerant levelled routing 
achieving this lower bound. We split the paths of the routing into the following 
sets of paths 

{Ua; 

x^y 

Bo = 

x,y 

Bi = 

x,y 

where Ai, A[, Bi,Ci, B[,C[ are mappings Z„ x Z„ ^ Z„. We choose Ai, A[, 
Bi, Ci, B'-, Cl so that both sets A = Ui^z^Ai and B = Ui^z„Bi will be levelled 
routings for {Kn^n,UxUUV xV) and {Kn,n,U xVUV x [/), respectively, where 
SxS = {(s, t) : s,t G S, s yf t}. The symbol (G, W) denotes the communication 
demand in which only pairs of nodes in W needs to be joined by a directed path. 
The following lemma describes the set of paths A. 

Lemma 4. There exists a levelled (n — \)-fault tolerant routing A = 
where 

Ai {Ux ^ ’^Ai{x,y) ^ '^x ^ ’^A[{x,y) ^ ^ 0, . . . , 7T 1, 

Xiiy 

connecting nodes from the same partitions of Kn^n which can be constructed in 
quadratic time. In particular, mappings Ai and A' can be chosen in the following 
way, for all i = 0, . . . ,n — 1, 



^ ‘^Ai{x,y) ^ '^yi '^x ^ '^A',{x,y) ^ 

Vy, Vx ^ Ux} , 

'^Bi(x,y) ^ ^Ci{x,y) ^ '^yi ^x ^ ’^B[{x,y) ^ ’^C[{x,y) ^ 



Ai{x, y) = L{-x, -y) + (f){i) 

Al{x, y) = L'{-x, -y) + (f{i) + 1, where 



(p{i) 



2t + 1 if n is even and * > f , 
2i otherwise. 



and (Zn,L) and {Zn,L') are two idempotent Latin squares. 

The proof will appear in the full version. A very simple construction of idem- 
potent Latin squares of order n > 3 is presented in [10]. 

Next lemma describes the set of paths B. We will need the following definition. 
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Definition 5. A Latin square is a column-normalized, if its columns are sorted 
in increasing order by the values in the first row. 

Lemma 5. We can construct a levelled {n — l)-fault tolerant routing 
B = Ui^z„Bi, where 

^0 = ^ Vy, V^ Ux} , 

x,y 

Bi ^ ^ '^Ci(x,y) ^ Vx ^ ^ ^Cl{x,y) ^ 

connecting nodes from different partitions of Kn,n in quadratic time. In partic- 
ular, mappings Bi,Ci, can he chosen as follows. ■ 

Bi{x,y) = LB{i,y), Ci{x,y) = Lc{i,x), 

B'fix,y) = LB'{i,y), C[{x,y) = Lc{i,x), 

where (Zn,LB), (Zn,Lc), (Z„,Lb/), (Zn,Lc) are any column-normalized Latin 
squares. 

The proof will appear in the full version. 

Lemmas 4 and 5 give the main result of this section. 

Theorem 4. We can construct a levelled (n— 1) -fault tolerant routing for Kn^n 
in quadratic time. In particular, for all f < n — 1, 

f5/ + 3, z//<|-l, 

7r/(iTn,„) = < 5/ -h 2, i/f-l</<n-l, 

[sZ-hl, iff = n-l. 



4.1 An Upper Bound for the Optical Index 

Consider a levelled /-fault tolerant routing TZf{Kn,n) as described in Lemmas 4 
and 5. We will consider the subroutings Ao, - . ■ ,Af and Bq,. ■ .,Bf separately. 
For each we build the path graph and upper bound its chromatic number. Then, 
by assigning different sets of wavelength for the paths in different subroutings, 
we have the following upper bound for the wavelength number of TZf{Kn^n)' 

f f 

w{Kx,,n,D,'R.f{Kr,,u)) < '^X{GAi)+'^X{GBi) ■ 

i—0 i—0 

For set of paths Ai, we show that the corresponding path graph is bipartite. 
For set of paths Bi we first derive a simple upper bound on the maximum degree 
of the corresponding path graph. Next we show that by a special choice of Latin 
squares (used in construction of the system) this upper bound can be improved. 
Upper bounds on optical indexes will follow from Brook’s theorem which gives 
an upper bound on chromatic number in terms of maximum degree of the graph. 
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Lemma 6. Consider a levelled (n — \)-fault tolerant routing described 

in Lemma 4- For every i = 0, . . . , n — 1, xiGAi) < 2. 

Proof. It is enough to prove that the path graph Q_^. is a bipartite graph. We 
have two types of paths: C/C/-paths which start and end in U and FF-paths 
which start and end in V. It is easy to see that there are no edges between the 
paths of the same type. Indeed, assume for instance that for x\ ^ y\, X 2 ^ 
U 2 , (xi.yi) yf (X 2 ,y 2 ), the paths VL{n-xi,n-yi)+ 4 ,{z) Uyi and Ux 2 

V L{n-x- 2 ,n-y 2 )+ 4 >(i) ^V 2 share the first arc. Then x\ = X 2 = n — x and L{x, n — 

yi) = L{x,n — y 2 ). Since L is a Latin square, yi = y 2 , a contradiction. 

Hence, for paths in Ai we need 2(/ + I) wavelengths. This is optimal for 
f < n/2 — 1, and differs from the optimal value by at most two for other values 
off. 

Lemma 7. Consider a levelled {n — 1) -fault tolerant routing described 

in Lemma 5. Then x{Gbo) = 1; o.nd for every i = 1, . . . ,n — 1, xi^Bi) < 6. 

The proof will appear in the full version. 

This does not give a very good upper bound for the optical index. The con- 
gestion induced by paths in Bi is 3, therefore the colouring we found might be 
twice the optimal. To improve the colouring we will further restrict the class 
of routings, by taking special mappings Bi,Ci,B[,Cf We need the following 
definition. 

Definition 6. Let {S,L) be a Latin square. The row-inverse of (S,L) is a pair 
{S, L), where L : S x S ^ S is a mapping such that L{u, v) = (B^)~^(v). Recall 
that the row mappings are bijective. 

The next lemma checks that the row-inverse of a Latin square is also a Latin 
square. 

Lemma 8. Let (S,L) be a Latin square. The row-inverse {S,L) is a Latin 
square. Lf(Zn,L) is column-normalized then so is (Zn,L). 

Proof. Obviously, the row mappings of (S', L) are permutations. Hence, assume 
that for some x,x' ,y € S, x yi x', we have L{x,y) = L{x',y). Then (R^)~^{y) = 
{Rx>)~^(y) = z. This yields 

Rx(z) = R^{{R!f)~Hy)) = y = Rx'i.z) . 

i.e., L{x,z) = L{x' ,z), a contradiction. 

The second part of the lemma follows by a simple observation that a Latin 
square is column-normalized if and only if the row mapping R^ is the identity 
mapping. 

Lemma 9. Consider a levelled {n — l)-fault tolerant routing Ui^z„Bi described 
in Lemma 5 such that Lb' is a row-inverse of Lc and Lc is a row-inverse of 
Lb (the column-normalized Latin squares Lb and Lc can be chosen arbitrary). 
Then for all i = 1, . . . , n — 1, xi^Bi) < 4. 
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The proof will appear in the full version. 

As a conclusion we have the upper bound for the /-tolerant optical index of 

^ n^n • 

Theorem 5. For every f = 0, ...,n — 1, there exists a levelled f -fault toler- 
ant routing TZ for Kn,n with the wavelength number w{Kn,n,D,TZ) < 6/ -I- 3. 
Consequently, 

5/+l<tn/(iT„,„)<6/ + 3. 
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Abstract. In this paper we consider the minimum feedback vertex set 
problem in graphs, i.e., the problem of finding a minimal cardinality 
subset of the vertices, whose removal makes a graph acyclic. The prob- 
lem is AfP-hard for general topologies, but optimal and near-optimal 
solutions have been provided for particular networks. We improve the 
upper bounds of [11] both for the two-dimensional mesh of trees, and 
for the pyramid networks. We also present upper and lower bounds for 
other topologies: the higher-dimensional meshes of trees, and the trees 
of meshes networks. For the two-dimensional meshes of trees the results 
are optimal; for the higher-dimensional meshes of trees and the tree of 
meshes the results are asymptotically optimal. For the pyramid networks, 
the presented upper bound almost matches the lower bound of [11]. 



1 Introduction 

The Problem. For a graph G = (V,E), a subset V GV \s called a feedback 
vertex set, if the subgraph of G induced hy V — V has no cycles. If V has 
minimum cardinality among all such subsets, then V is called a minimum vertex 
feedback set and is denoted by V. In this paper we study the minimum feedback 
vertex set problem (MFP) which is the problem of finding minimum feedback 
vertex sets. 

Old and New Results. MFP is known to be AfP-hard for general networks 
[6], but optimal and near-optimal solutions have been provided for particular 
topologies [1, 3-5, 7, 10, 11]. 

In this paper we consider MFP for certain undirected graphs of bounded 
degree, namely two- and higher-dimensional meshes of trees networks, trees of 
meshes and pyramid networks [2,8,9,13-16]. Upper and lower bounds for the 
two-dimensional mesh of trees and pyramid networks have been presented in 
[11]. For the two-dimensional mesh of trees we improve the upper bound of [11] 
obtaining a tight bound. Using a different construction, asymptotically optimal 
results are obtained for all higher dimensions. We then consider a topology for 
which MFP was not been studied before, the tree of meshes TMn, and we show 
that at least 2/3 • • logn — 5/9 ■ + n — 4/9 vertices must be removed and 



R. Kralovic and O. Sykora (Eds.): SIROCCO 2004, LNCS 3104, pp. 209-220, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 




210 Flaminia L. Luccio and Jop F. Sibeyn 



construct a set with \V^\ = 2/3 • • logn — 25/72 • + 2/9. Finally, we 

consider the pyramid networks and we improve the upper bound of [11] using a 
construction that is quite involved. Nevertheless we do not get asymptotically 
tight results: there is a ratio 135/128 ~ 1.055 between upper and lower bounds. 

When considering lower bounds, we will refer to a result of [1] applied to 
undirected graphs in which the maximum degree of the vertices is r. The number 
\V\ of vertices to remove is: \V\ > [(jifl — \V\ + l)/(r — 1)] . (1) 

2 Meshes of Trees 

2.1 Two-Dimensional Meshes of Trees 

Definition 1. A pxg mesh is a graph Mp^q = {V, E) with Vij & V , 0 < i < p—l, 

0 < J < 9 ~ 1; 0 ,'nd E contains exactly the edges (uij-, Ujj+i), j ^ q — 1, and 
{vij J Vi+l,j) ) i^P !■ 

Lemma 1 ([10]). A minimum feedback vertex set of a pxq mesh Mp^q has size 
at least \{{p — 1) • (g — 1) + l)/3] . 

Definition 2 ([8, 15, 16]). A height- fc complete binary tree Tk = {V, E) is a 
graph composed of 2^+^ — 1 vertices labeled with all different binary strings of 
length 1 < i < k. The root at level 0 is labeled with a distinct value e, the 2* 
vertices at level i, 1 < i <k, are labeled with all possible distinct binary strings 
of length i in lexicographical order. An edge connects any vertex x lying at level i, 

1 <i <k with label (6i, . . . , bi-i,bi) with vertex with label (6i, . . . , bi-i) at level 
i — \ . A two-dimensional m x n mesh of trees, where m and n are powers of 2, 
is a graph MTm,n obtained from an mxn mesh Mm,n by: 1 ) removing all mesh 
edges; 2) building a height-hi, h\ = logn, complete binary tree along each row, 
using the row-vertices as leafs of the tree; 3) building a height-h 2 , /12 = logm, 
complete binary tree along each column, using the column-vertices as leafs of the 
tree. 

In the meshes of trees, the trees along the rows and columns are called row- 
trees and column-trees, respectively. The definition is illustrated in Figure 1 A). 
Two-dimensional meshes of trees are non-regular as the mesh and root vertices 
have degree 2 while all other vertices have degree 3. MTm,n has 3-m-n — m — n 
vertices and 4-m-n — 2-m — 2-n edges. 

In [11] relation (1) was used to provide the following: 

Lemma 2. A minimum feedback vertex set of a two-dimensional mesh of trees 
MTjn,n has size at least m • n/2 — m/2 — n/2 -\- 1. 

Moreover, in [11] Luccio presented a feedback vertex set V of size at most 
^ ■ m ■ n. This bound differs from the lower bound in Lemma 2 for a multi- 
plicative constant 9/8 in the term of highest order. We now improve this upper 
bound obtaining an optimal bound. To do so we propose two new vertex-removal 
strategies and we consider their performance. The simplest is the mesh-removal 
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B) 




A) 




Fig. 1 . A) MT43. The circles give the mesh vertices, the triangles the vertices of 
the row-trees and the squares the vertices of the column-trees. B) Minimum feedback 
vertex set for MTs,s- The removed vertices are colored white. The edges incident upon 
removed vertices are drawn with thin lines. 



strategy, where V consists of all mesh vertices. In the coordinate- axis-removal 
strategy V consists of all vertices of the row-trees at one level above the leafs 
(alternatively one can take all vertices of the column-trees at one level above the 
leafs). For the proof of the following lemma refer to [12]. 

Lemma 3. Applying the mesh-removal strategy to MTm,n, = m-n. Apply- 
ing the coordinate-axis-removal strategy, \V'\ = m ■ n/2. Either strategy gives a 
feedback vertex. 

The mesh-removal strategy removes twice as many vertices as necessary, but 
the coordinate-axis-removal strategy is asymptotically optimal. We now con- 
sider how to further improve it. The set V is constructed along the lines of the 
coordinate-axis-removal strategy with three modifications: 1) None of the ver- 
tices in row 0 is added to V . 2) None of the leftmost vertices at level logn — 1 
of the row-trees (the vertices between column 0 and column 1) is added to V . 
3) All vertices in column 1 at level log m — 1 of the column-tree are added to 
V' . The last two of these modifications were suggested by an anonymous referee 
of Sirocco 2004, which is gratefully acknowledged. The modified construction is 
illustrated in Figure 1 B), and for the proof of the following lemma refer to [12]. 

Theorem 1. „ has a vertex feedback set V with \V'\ = m ■ nj2 — m/2 — 

n/2-bl. 

2.2 Higher-Dimensional Meshes of Trees 

Definition 3. A d-dimensional n\X ■ ■ ■ xud mesh of trees, denoted 
is obtained from a d-dimensional rii x • • • x cube by: 1 ) removing all mesh edges; 
2) building a height-lognj complete binary tree along each of the Y[i<i^j<d''^i 
one- dimensional subarrays of length nj running along coordinate axis j. 
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A d-dimensional ni x • • • x Ud-i x nj, mesh of trees can also be obtained by 
creating copies of d — 1-dimensional rii x • • • x Ud-i meshes of trees and 
connecting these at the bottom level by rii<i<d-i^* complete binary trees of 
height log Ud- For d > 3, the mesh vertices have maximum degree, their degree 
being d. All other vertices have degree 3 except for the tree roots with degree 2. 
In the following we are not interested in exactly matching the lower bounds, so 
there is no need for exact estimates of the number of vertices and edges either. 
For we use the following estimates: 

Lemma 4. 



|y| = (d+l). J] n.-0( ^ n (2) 

l<i<d 

\E\ = 2-d- Y[ n.-0(Y. n (3) 

l<i<d ^^j^d 

Proof. These estimates can be proven using induction from the recursive con- 
struction: The lemma is correct for d = 2. For larger d, the leading constant 
in (3) increases by 2 when going from dimension d — 1, because each of the 
new connecting trees contributes its 2 • — 2 edges. The leading constant in 

(2) increases only by 1, because rid of the 2 • — 1 vertices of the added trees 

are grid vertices, which are identified with the vertices of the other trees. For 
ni = • • • = rid, the results of this lemma can be found in [8]. □ 

Substituting these values in (1) together with r = d — 1, we get: 

Corollary 1. For a d-dimensional ni x ■ ■ ■ x Ud mesh of trees with d > 3, a 
vertex feedback set has size at least rii<i<d’^i ~ ’^i)- 

The two vertex-removal strategies can also be generalized: the mesh-removal 
strategy now amounts to removing all vertices from the d-dimensional ni x • • • x 
Ud mesh, the coordinate-axis-removal strategy amounts to removing all vertices 
along the d — 1 coordinate axes at one level above the leafs. For example, V can 
be built by taking all at level log Uj — 1 from all meshes of trees running along 
coordinate axis j, for all 2 < j < d. Now we obtain the following generalization 
of Lemma 3: 

Theorem 2. For a d-dimensional n\ x ■ ■ ■ x Ud mesh of trees with d > 2, the 
set V constructed with either the mesh-removal or the coordinate-axis-removal 
strategy is a feedback vertex set for With the first strategy \V'\ = 

rii<i<d^o wzt/i the second strategy \V'\ = (d — l)/2 • rii<i<d^*- 

Proof. The correctness is proven as before: removing the mesh results in a set of 
trees whose depth has been reduced by one. Leaving out all at one level above 
the leafs gives us a set of trees of depth two smaller than the original trees plus 
the isolated mesh vertices. Reinserting the vertices along one coordinate axis 
connects the mesh vertices with one tree, not creating any cycles. In total the 
coordinate-axis-removal strategy thus removes d — 1 sets of each (rii<i<d’^i)/2 
vertices. □ 
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Both strategies are correct for all d > 2, but they are not equally good: 
for d = 2, only the coordinate-axis-removal strategy is asymptotically optimal, 
whereas for c? > 4, we must use the mesh removal strategy to achieve this. Only 
for c? = 3 we can choose. 

3 Trees of Meshes 

Definition 4. [9, 13] For a power of two n, the two-dimensional n x n tree of 
meshes, TMn, is an undirected graph obtained by replacing each vertex 

of a complete binary tree with a mesh and each edge by several edges which 
link the meshes together. More precisely, the root is an n x n mesh and its two 
children are n x n/2 meshes, their children are n/2 x n/2 meshes and so on, 
until the leafs are replaced by 1 x 1 meshes. An n' x n' mesh in this tree is 
linked to its two n' x n'/2 children by connecting the vertices in the rightmost 
(leftmost) column to the vertices in the leftmost (rightmost) column in the mesh 
corresponding to the right (left) child. An n! xn! j2 mesh in this tree is linked to its 
two n'/2 X n'/2 children by connecting the vertices in the topmost (bottommost) 
row to the vertices in the bottommost (topmost) row in the mesh corresponding 
to the right (left) child. These connections preserve the column and row order of 
the vertices and the planarity of the graph. 

For TMn is a non-regular graph, with vertices of degree 1, 3 and 4. \Vn\ = 
2 ■ ■ logn -I- and |if„| = 4 • • logn — + n. The definition is illustrated 

in Figure 2. 

Using (1), we get the following lower bound on the number of vertices to 
remove: |U„| > [((4 • • logn — n^ -I- n) — (2 • • logn-h n^) -I- l)/3] = [(2 • n^ • 

(logn — 1) -I- n -I- l)/3] . For n = 2,4, 8, 16, this gives 1, 13, 89, 518, respectively. 

This lower-bound result can be sharpened by separately considering the nu- 
merous meshes embedded in a tree of meshes in combination with Lemma 1. 
Consider TMn with n = 2^. It is composed of four copies of TMn /2 connected 
to an n X (2 • n) mesh. For n = 2^, it turns out that |"(n — 1) • (2 • n — 1) -I- l)/3] = 
(2 • n^ -I- l)/3 — n -I- 1. So we obtain the following recurrence relation: 

Fi| = 0, |F„| >4 - |F„/ 2 | + (2-n"-3-n + 4)/3. 

For n = 2, 4, 8, 16, this gives 2, 16, 100, 556, respectively. This inhomogeneous 
recurrence can be made homogeneous, which results in a characteristic polyno- 
mial of degree 4. The roots can be guessed easily. Using the smallest values, the 
coefficients can be determined. This gives: 

|F„| > 2/3 • • logn - 5/9 • -k n - 4/9. (4) 

An upper bound can be found analogously. Again we use that TMn consists 
of four copies of TMn /2 connected to an n x (2 • n) mesh. The feedback vertex 
set Vf of TMn is recursively constructed as follows: 1) V{ is the empty set. 2) Vf 
is composed of the four subsets which are taken to be identical in each of 
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Fig. 2. A) A two-dimensional TMg. The white vertices give a feedback vertex set of 
size 114, 14 more than the lower bound. B) Feedback vertex set for TMg of size 106, 6 
more than the lower bound. 

the four copies of TM „/2 plus a suitable subset of vertices from the connecting 
nx (2-n) mesh. This mesh subset consists of every third diagonal. These diagonals 
run from upper-right to lower-left. The first is starting in position (0,2). This 
construction is illustrated in Figure 2 A). For its correctness refer to [12]. 

Let us now consider \V^\. (2 • — 2)/3 vertices are added from the nx (2-n) 

mesh. So, we get the following recurrence relation: 

|F/| = 0, K\=A-K^2\ + i^-n^-^)/^- 

For n = 2,4,8, 16, this gives 2, 18, 114, 626, respectively. There are considerable 
deviations from the lower bound, but these are limited to the lower-order terms: 

Lemma 5. The constructed set is a feedback vertex set for TMn- \Vf \ = 
2/3 -n^ •logn-2/9-n2-h2/9. 

For the correctness of this lemma refer to [12]. It is not hard to improve this 
result by giving improved constructions for the smallest TM^. For example, using 
a feedback vertex set for TM 4 with 16 instead of 18 vertices, saves 2 vertices for 
each of the n^/16 disjoint copies of TM 4 in TMn, saving n^/8 vertices in total. 
Such an improved construction for TM 4 can be used in the construction of a 
feedback vertex set for TM„ without creating cycles as shown in Figure 2 B). 
This reduced-size feedback vertex set is denoted Yff . 

Theorem 3. The constructed set Vff is a feedback vertex set for TMn- \Vn \ = 
2/3 • . log n - 25/72 • -b 2/9. 

Comparison with (4) shows that this result is tight to within 5/24 • . Going 

on like this the gap between upper and lower bound can be further reduced. 
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4 Pyramid Networks 

Definition 5. [2, 14] A pyramid network of height n is a graph Pn = {Vn, En) 
with vertex set Vn = {(i,x,y)\0 < i < n,0 < x,y < 2®}, and a set of undirected 
edges given by En = {{{i,x,y), (i,x,y + 1))|1 < i < n,0 < x < 2^,0 < y < 
2* — l}U{((i,a;,j/),(?,a:-|-l,y))|l < i < n,0 < x < 2* — 1,0 < y < 2®}U 
{((i,x,y),(i- 1, \x/2\,\_y/2\))\l < i < < x < 2\{) < y < 2 *}. 

Informally, a pyramid network is composed of a collection of meshes of 
increasing sizes. Level i, 0 < i < n, consist of a 2® x 2® mesh. The 1x1 
mesh at level 0 constitutes the top of the pyramid. In addition to the mesh 
connections, each vertex is connected to one vertex in the level above it and 
four vertices in the level below it. |F„| = “ (4"+^ — l)/3; |if„| = 

Er=i(2* • (2* - 1) + (2* - 1) • 2* -h V) = 4"+i - 2"+2. A pyramid network is a 
non-regular graph with vertices that may have degree 3, 4, 5, 7, 8 and 9. The 
definition is illustrated in Figure 3 A). 




Fig. 3. A) P 2 . The white vertices give a feedback vertex set of size 8, matching the 
lower bound. B) P 3 . The white vertices give feedback vertex sets of size 34, matching 
the lower bound. 



Small Pyramids. Consider the two-level pyramid Pi. Removing only the top 
vertex, the cycle in the 2x2 mesh at level 1 remains. Removing a single vertex 
at level 1, there remains a cycle through the other four vertices. So, at least two 
vertices must be removed from Pi . All cycles in Pi can be eliminated by either 
removing the top vertex and an arbitrary vertex at level 1 or by removing two 
diagonally opposite vertices at level 1. Both constructions are used in Figure 3A). 
Thus, |Fi| = 2. _ _ 

Now consider P 2 . It contains four copies of Pi. Therefore, IR2I > 4- \V i\ = 8. 
As we can see from Figure 3 A) it also suffices to remove 8 vertices from P 2 in 
order to obtain a cycle-free graph. Thus, IR2I = 8. 

Finally consider P3. It contains four copies of P 2 . Therefore, jPal > 4- IR2I = 
32. However, we can prove more, observing that P3 contains in total 17 copies 
of Pi: 16 at the two lowest levels and 1 at the two highest levels. Thus, IR3I > 
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17 • |T^i| = 34. As we can see from Figure 3 B) it also suffices to remove 34 
vertices from P 3 in order to obtain a cycle-free graph. Thus, = 34. 

For larger we do not know the exact value of \Vn\- Particularly, for P 4 
the best lower bound we know is IP 4 I > 4 • jPaj = 136. If there would not have 
been a vertex feedback set V' for P 3 with \V'\ = 34 containing the top of the 
pyramid we could have proven a lower bound of 138, but the right picture in 
Figure 3 B) shows that such a set V exists. A feedback vertex set for P 4 of size 
142 is shown in Figure 4 A). This is the best we were able to construct. 




Fig. 4. A) P 4 . The white vertices give a feedback vertex set of size 142, 6 more than the 
lower bound. Edges incident upon removed vertices are not drawn. B) An extendible 
two-level pattern. 



A general lower bound on |P„| is the following: 

Lemma 6. [11] For a pyramid network Pn of height n, we have |P„| > 2/15 • 

4U+1 _ ^ 

In [11] Luccio presented a feedback vertex set V’ of size at most 7 /45 • 4"+^ — 
2^+'^ [9 + ^112 ■ n + 1. This bound differs from the lower bound in Lemma 6 for 
a multiplicative constant 7/6 in the term of highest order. 

Elementary Patterns. First observe that the following bound trivially holds: 
|Pn| > 4 • At first one might hope that a vertex feedback set for P„ 

can be obtained by taking suitable feedback vertex sets for the four copies of 
Pn-i and removing the top. This would give |P„| < 4 • |Pn-i| -I- 1. However, the 
case n = 3 shows that this does not work. The reason is that when building P„ 
from four copies of Pn-i and a new top vertex, cycles do not only arise by the 
connections added at the top, but particularly also by the connections added 
at the lower levels. To cope with this problem we introduce the notion of an 
extendible pattern (see also Figure 4 B) and Figure 5): 

Definition 6. Let Pk,i, 1 < I < k + 1, be the graph with the structure of the I 
lowest levels of P^. An extendible I -level pattern consists of a vertex feedback set 
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V for Pk,i with the special property that taking 'P , for any j > 0, copies of Pk,i 
with the vertices in V and their incident edges removed, no cycles arise when 
gluing these together in the way they would appear in Pk+j,i ■ 

Lemma 7. There is an extendible two-level pattern removing 32 of the 80 ver- 
tices in Pz, 2 - 

Proof. Figure 4 B) gives the two-level pattern. In this picture 4 copies of the 
pattern for Ps ^2 are glued together. The result is a pattern of connected but cycle- 
free structures spanning three diagonals, separated by diagonals which have been 
entirely removed. From the 64 -|- 16 = 80 vertices of ^3,2, 24 -|- 8 = 32 vertices 
are removed. □ 

Assume that we know an extendible Flevel pattern, then a feedback vertex 
set of Pn can be constructed by, starting from the bottom, alternatively removing 
the vertices in the pattern in I levels, and then removing all vertices in one level. 
More precisely, the levels n -\- 1 — j ■ {I -\- 1), 1 < j < [(n -I- l)/(? + 1)J, are 
removed entirely, while on the intermediate levels the removal is dictated by the 
extendible pattern. Let ¥^ 2 be the feedback vertex set constructed for with 
help of the pattern in Figure 4 B). 

Lemma 8. = 9/63 • 4”+^ -h 0(1). 

Proof. We do no longer consider every individual contribution. Rather we notice 
that from any disjoint copy of P^^s contained in P„, 32-1-4 = 36 out off 80-1-4 = 84 
vertices are removed. Only at the top there may be one partial copy of P3 3. So, 
= 36/84 -|K|± 0(1). ’ □ 

Comparing with the results from Lemma 6, we see that this construction leaves 
a factor 15/14 between upper and lower bound. Further improvements can be 
obtained by using higher-level patterns. The construction of Figure 5 gives the 
following: 

Lemma 9. There is an extendible three-level pattern removing I 4 O of the 336 
vertices in Pa, 3 . 

Let Vf 3 be the feedback vertex set constructed for with help of the pattern 
in Figure 5. The following is proven analogously to Lemma 8: 

Lemma 10. |y/_ 3 | = 12/85 • 4"+^ -k 0(1). 



Best Construction. The given feedback vertex set based on the three-level 
pattern can be further improved by making the construction of the pattern 
explicit and generalizing it for more levels. In this way we obtain a feedback set 
which, without adding new ideas, cannot be further improved. 

Let us consider the extendible two- and three-level patterns as given in Fig- 
ure 4 B) and Figure 5 in more detail. In this discussion the levels will be counted 
from the bottom, so level 0 now refers to the bottom level. In both patterns 6 
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Fig. 5. An extendible three-level pattern. 



out off any 16 level-zero vertices are removed. The remaining vertices together 
with the edges running between them constitute the pattern consisting of shifted 
crosses. In the two-level pattern every second level-one vertex is removed. This 
suffices to eliminate all grid edges at this level and connects the crosses in a di- 
agonal way. In the three-level pattern we could do the same, but then we could 
have left only very few level-two vertices. Therefore we remove some extra level- 
one vertices. By removing 36 out off any 64 level-one vertices, the diagonals are 
cut in pieces which allows to keep every second level-two vertex. These level-two 
vertices connect the diagonal pieces into diagonal bundles. 

This idea can be generalized. If we want to construct an extendible four- 
level pattern, we start with four copies of the three-level pattern. This gives 
the situation in Figure 5. In order to be able to keep a substantial number of 
level-three vertices, we must remove some extra level- two vertices. As in the 
transition from the two-level to the three-level pattern, by removing 36 out off 
any 64 level-two vertices, the diagonals are cut in pieces. Adding every second 
level-three vertex connects these into diagonal bundles again. The lowest four 
levels of vertices in Figure 6 A) illustrate this construction. 

Arguing on, we see that an extendible (/c-|-l)-level pattern can be constructed 
by taking four copies of an extendible fc-level pattern removing an additional 
1/16 of the level-(fc — 1) vertices and every second level-A: vertex. This idea can 
be applied for constructing an even better feedback vertex set for for any 
n > 3: for the lower n — 2 levels we use the extendible (n — 2)-level pattern. 
For the highest three levels we use the pattern depicted in Figure 6 B). Let 
denote the thus constructed set of vertices to remove. 
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Fig. 6 . A) P 5 . The white vertices give a feedback vertex set of size 574, 29 more than 
the lower bound. B) The removal pattern at the top of P„ for n > 3. 



Theorem 4. For all n > 3, is a feedback vertex set for \Vf \ = 9/64 • 

^n+l _ 2 

Proof. At the lower n — 2 levels the extendible pattern connects the vertices into 
eight diagonal bundles spanning 2"“^ — 1 diagonals each. By the construction at 
the top these bundles are interconnected without creating cycles. The top vertex 
is not removed, from level 1 and level 2 half of the vertices are removed. From the 
following levels 9/16 of the vertices are removed, except for level n where only 3/8 
is removed. This gives: \Vf \ — 0-1-I-1/2-4-I-1/2-16-I-9/16-64-I-- • --|-9/16-4”“^-|- 
3/8-4" = 9/16-X;r=o 4”-9/16-1-1/16-4-1/16-16-3/16-4"= 9/64-4”+i-2. 

□ 

Comparing with the results from Lemma 6, we see that this construction leaves 
a factor 135/128 ~ 1.055 between upper and lower bound. 

5 Conclusion 

In this paper we have considered the minimum feedback vertex set problem 
in graphs. We have provided lower and upper bounds for the problem in the 
two- and higher-dimensional mesh of trees networks, in the tree of meshes and 
pyramid networks. Further study is needed to narrow the gap between lower and 
upper bounds for the pyramid networks. A possible approach is to prove a higher 
lower bound for P4, as this would immediately lead to higher lower bounds for 
all Pn with n > 4. Such a proof might be given with help of a refined branch- 
and-bound algorithm. A good branch-and-bound algorithm would probably also 
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allow to reduce the gap between upper and lower bound for the trees of meshes 

to less than e • n^, for any e > 0. 
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Abstract. Load balancing on a multi-processor system consists of re- 
distributing tasks among processors so that all processors end up with 
roughly the same amount of work to perform. The token distribution 
problem is a variant of the load balancing problem where each task has 
unit-size and it represents an atomic element of work. We present an 
algorithm for computing a perfect token distribution (each processor has 
either \T/N^ or \T/N\ tasks, where N is the number of processors and 
T is the number of tasks scattered among processors) on distributed tree- 
connected networks having worst-case running time 0{TD) {D denotes 
the diameter of the tree). The number of token exchanges exceeds the 
optimum by at most 0{D mm{T, N}). 

In order to compute a perfect token distribution each node v must be 
able to store 0{dv{\ogT + log A)) bits, where dv is the degree (number 
of adjacent nodes) of v. This is the first fully decentralized algorithm for 
computing perfect token distributions on arbitrary tree-connected net- 
works which does not receive as input any kind of aggregate information 
about the network (e.g., number of nodes or total number of tokens). 



1 Introduction 

The performance of a distributed network crucially depends on dividing up work 
effectively among its processing elements [8] . This type of load balancing problem 
has been studied in many different models. The basic idea in all of the models is 
to evenly redistribute initial job load among processors (static balancing) and to 
keep load distribution as balanced as possible during time (dynamic balancing). 
Many variants of the load balancing problem have been proposed and widely 
investigated in the literature [1-4,6,9-11]. 

In this paper we consider a basic variant of the static load balancing problem, 
called token distribution problem, restricted to tree-connected networks. A tree- 
connected network is represented by a tree with N nodes. Each node represents 
a processing element of the network and possesses a number of jobs of unit-size 
(tokens) to be processed. The total number of tokens is denoted by T. In a single 
message a token can be moved from any node to any other adjacent node in the 
tree. No token is created or destroyed during the redistribution process. The 
goal is to redistribute tokens across the tree so that each node ends up (being 

* This work was partially supported by the Future & Emerging Technologies unit of 
the European Commission through Project BISON (IST-2001-38923). 
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@ Springer- Verlag Berlin Heidelberg 2004 




222 



Luciano Margara, Alessandro Pistocchi, and Marco Vassura 



aware of that) with either [T/7V] or [T/fVj tokens {perfect token distribution). 
We adopt asynchronous single-port communication model with uni-directional 
communication links. Nodes are anonymous, i.e. they do not have identification 
labels. Synchronization between nodes (executing the same code in parallel) is 
achieved by exchanging messages. 

Previous Results. For general networks Ghosh et al. [4] analyze two algorithms 
which reduce the maximum difference in tokens between any two nodes (called 
"discrepancy") to at most O {{cP log N)/ a), where d is the maximum degree of 
the nodes of the network, N is the number of nodes in the network, and a is 
the edge expansion of the network. Many results have been carried out on spe- 
cific network topologies. For ring networks Gehrke, Plaxton, and Rajaraman [3] 
give an algorithm having an asymptotically optimal message complexity which 
converges to a perfectly balanced state. 

For meshes and torus Houle et al. [7] give an algorithm that reduces the dis- 
crepancy to the minimum degree of the nodes of the network and that runs 
in worst-case optimal time. The same algorithm used for complete binary trees 
obtains in the worst case a discrepancy equal to the height of the tree [6] . 

For arbitrary trees Houle, Symvonis, and Wood [5] give an algorithm for com- 
puting a perfect token distribution assuming that each node at the beginning of 
the computation knows the number of nodes in the tree. 

Our Results. We present a fully decentralized algorithm for tree-connected 
networks which computes a perfect token distribution in time 0{TD) without 
receiving as input any additional (aggregate) information about the network 
such as the total number of nodes or the total number of tokens. The number 
of token exchanges made by our algorithm exceeds the optimum by at most 
0(1? min{T, TV}). In addition, at the end of the redistribution process, all the 
nodes reach a distinguished final state and they are ready to start a new activity. 
Our technique is based on a three phase approach. 

- Phase 1. We compute (in a completely distributed way) the number of nodes 
of the tree and the number of tokens scattered across the tree in 0{D d), where 
d is the maximum degree and D is the diameter of the tree^. 

- Phase 2. Let v be any node of the tree and Tj, 1 < i < k, be the k sub-trees 
rooted at nodes adjacent to v. After Phase 1 v knows the number of nodes and 
the number of tokens contained in each Ti. Taking advantage of this information 
each node having more than [T/N\ tokens is able to decide where to send its 
extra tokens. Nodes that have less than [T/N\ tokens wait until they receive 
enough tokens from their neighbors. After Phase 2 we get an "almost perfect" 
token distribution in which at least N—1 nodes have exactly [T /N\ tokens. Note 
that almost perfect token distributions have discrepancy at most (T mod N). 

- Phase 3. We refine the distribution obtained after Phase 2 in order to make it 
perfect. To this extent, the node containing more than \T/N~\ tokens (at most 
one node has this property) sends them to its neighbors. This procedure is then 
executed recursively by all the neighbors of v. 

^ In multi-port models such information can be computed in 0{D). 
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The rest of the paper is organized as follows. In Section 2 we define some com- 
munication primitives that will be used in our algorithm. In Section 3 we present 
our algorithm while in Section 4 and 5 we sketch the proof of its correctness and 
we discuss its computational cost in terms of time, space, and number of token 
exchanges, respectively. Section 6 contains conclusions and a brief description of 
possible open problems. 

2 Communication Primitives 

We define the following communication primitives: 

-reserve{c) reserves channel c for communication and returns busy if c is already 
reserved; 

-release{c) releases channel c reserved for communication; 

-send(c,m) sends message m through the already reserved channel c; 

-receive{c) receives a message from channel c if a message was sent on channel c 
(channel needs not to be reserved). If no message is arriving from c the message 
returned by receive is NULL. If two or more messages are sent on a channel 
then they are received one by one in the order they were sent. 

Using these primitives we build functions: 

-waitany{c) interrogates every channel until a message is received, then returns 
the message received, saving the channel through which it arrived in the param- 
eter c; 

-wait{c) interrogates channel c until a message is received, then returns the 
message received; 

-safeSend{c,m) waits until c is not busy, reserves it, sends m through c, and 
then releases c; 

-receiveToken{c) receives a token from c, if any. If a message is received updates 
local variables containing the number of tokens; 

-receiveTokensFromAll{) receives, using receiveToken{c), tokens sent from 
neighbors, if any; 

-sendToken(c) sends, using safeSend{c,m) a token from the current node 
through c and updates the local variables containing the number of tokens. 

3 The Algorithm 

Our algorithm consists of three phases: first it collects information about the 
tree structure and the initial token distribution, then uses this information for 
obtaining an almost perfect token distribution, and finally it refines the solution 
by redistributing a small number of tokens. 

In what follows we assume that each node v can store 6>((i^,(log^ -l-logA^)) 
bits of information. 

3.1 Phase 1 

Each node v has an internal state defined according to the values of two local 
variables, namely S and R which contain the number of messages sent and re- 
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ceived by v so far. Let deg be the degree of v. Both S and R are set to 0 at 
the beginning of the computation. According to the values of S and R we define 
5 distinct states: 

S'!.!: [S = 0 and R < (deg -1)]. All the nodes having degree greater than 1 start 
in this state. Nodes in this state are waiting for messages from their neighbors. 
Any node with k neighbors waits for k — l messages and then changes state (from 
S'!.! to 51.2). 

51.2: [S = 0 and R = (deg -1)]. Leaves start in this state. Each node with k 
neighbors reaches this state after receiving k—l messages. A node in this state 
tries to send information about its subtree to the only neighbor from which it did 
not receive any message yet. If succeding the node changes state (from 51.2 to 
51.3). Otherwise, if the channel is busy then the last neighbor is sending subtree 
information. The node receives it and changes state (from 51.2 to 51.4). 

51.3: [S = 1 and R = (deg -1)]. Nodes in this state are waiting for global 
information from the neighbor to which they sent their local information. 

51.4: [S = 0 and R = deg]. Only one node reaches this state: the one computing 
global information. In this state it sends such information to all its neighbors 
and then ends. 

51.5: [S = 1 and R = deg). In this state a node has already received global 
information and forwards it to all its neighbors and then ends. 

In phase 1 each message msg consists of two integer numbers, referred to as 
msg.T and msg.N representing the number of tokens and of nodes of the entire 
subtree rooted at the sender of msg. We assume that channels at each node v 
are numbered from 1 to deg. 



3.2 Phase 2 

In phase 2 we start moving tokens. 

From now on we will refer to the node which received local information from 
all its neighbors as the root of the tree. Every node has two arrays, namely 
subTreeN and subTreeT, containing the total number of nodes and the total 
number of tokens in the subtrees rooted at its neighbors. Each node, knowing 
total number of tokens T and of nodes N in the tree, computes the average 
number of tokens per node Avg = [T/N\. All nodes send or wait for tokens until 
they and the nodes in their subtrees contain exactly Avg tokens, except for the 
root of the tree which might contain extra tokens. Function subTreeAvg{\) is a 
simple function computing the average number of tokens in the subtree rooted 
at Vi', it returns subTreeT[i]/subTreeN[i]. Each node v in Phase 2 has an internal 
state represented by 

— a boolean variable subTreeBalanced {true if all the subtrees rooted at children 
of V are balanced, false otherwise); 

— an integer variable T„ storing the number of tokens at v; 

— an integer variable parent which is set to 0 if u is the root. 
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Phase 1 

INITIALIZATION 

deg dxi 

T-y ■«— tokens at v 

N^, ^ 1 

R ^ 0 

S ^ 0 



parent 0 

subTreeT array of deg NULL elements 
subTreeN array of deg NULL elements 



//neighbors of v 
//tokens in subtree rooted at v 
//nodes in subtree rooted at v 
//messages received by v 
/ /messages sent by v 
//tfie parertt of the node 
//tokens in neighbor subtrees 
//nodes in neighbor subtrees 



//receive local info from all neighbors but one 

//waits for a message from any neighbor 

//tokens in subtree rooted at v 

//nodes in subtree rooted at v 

//tokens in subtree rooted at Vi 

/ /nodes in subtree rooted at Vi 

/ /v received a message 



while(S < deg or R < deg) do 
case (S,R) of 
S = 0. R < (deg — 1) : 

while(R < (deg — 1)) do 
msg-«— it)aiiant/(i) 

Tv ^Tv+msg.T 
Nv ^Nv+msg.N 
subTreeT [i] msg.T 
subTreeN [i] ■«— msg.N 
R^R+1 
endwhile 

S = 0, R = (deg — 1) : 

parent ^ — channel j for which subTreeT [j]=NULL 

//the channel from which v did not received msg 
if (reserve(parent) ^busy) then //check that Vi is not using the channel 

//check if received the message 
/ /message not received 
/ /nodes in subtree rooted at v 
//tokens in subtree rooted at v 
//send values of subtree rooted at v to Vpa 
/ /release channel pa7'ent 
/ /v sent a message 



msg<— rece?'t;e(parent) 

if (msg— NULL) then 
msg.T< — Tv 
msg.N^Nv 
5en<i(parent,msg) 
reiease(parent) 
S^S+1 
else 

reiease(parent) 

Tv ^Tv+msg.T 
Nv ■«— Nv+msg.N 
subTreeTCparent] ^ 
subTreeN [parent] 
parent-*— 0 
R^R+1 
endif 
else 

msg^t<jaif( parent) 

Tv •* — Tv+msg.T 
Nv *— N„+msg.N 
subTreeT [parent] msg.T 
subTreeN [parent] ^msg.N 
parent^ 0 
R^R+1 
endif 

S = l,R=(deg-l): 
msg*— trait (parent) 
subTreeT [parent] ^msg.T— T„ 
subTreeN [parent] *— msg.N — Ny 
Tv *— msg.T 
N„ ■*— msg.N 
R^R+1 



//release channel par'ent 
//tokens in subtree rooted at v 
//nodes in subtree rooted at v 
msg.T //tokens in subtree rooted at Vparent 
msg.N //nodes in subtree rooted at Vparent 
/ /v is root, parent = 0 
//v received a message 



//vparent using the channel : wait for the message 

//tokens in subtree rooted at v 

//nodes in subtree rooted, at v 

//tokens in subtree rooted at Vparent 

//nodes in subtree rooted at Vparent 

/ /v is root, parent = 0 

//v received a message 



//waits for global message 
//tokens in subtree rooted at Vparent 
//nodes in subtree rooted at Vparent 
//total tokens of the tree 
//total nodes of the tree 
/ /v received a message 



5 = 0,R = deg: 
msg.T*— Tv 
msg.N*— Nv 
for I-*— 1 to deg do 
I safeSe.nd(\,rr\s%) 
endfor 
S^deg 

S = 1. R = deg: 
msg.T-*— Tv 
msg.N-*— Nv 
for 1 to deg do 
I if (i 7 ^ parent) then 
I I saf eSend{\, msg) 

I endif 
endfor 
S^deg 
endc8ise 
endwhile 



//total tokens of the tree 
//total nodes of the tree 
//for all neighbors Vi 
//send, global values to Vi 

/ /v sent deg messages 

//total tokens of the tree 
//total nodes of the tree 

/ /has to send global values to all neighbors 

//except Vparent 

//send global values to v-i 



/ /v sent deg — 1 tnessages 
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Phase 2 

INITIALIZATION 

Avg ^ L T„/N„ J 
^ tokens at v 
R ^ 0 

bigger ^ {i|sw 6 TreeAi)( 7 (i) > Avg, i ^ parent} 
smaller ^ {i|su 6 Tree^i; 5 r(i) < Avg, i ^ parent} 
subTreeBalanced-«— (bigger= 0 and smaller= 0) 

while(not((subTreeBalanced=true and T^;=Avg) 

or (subTreeBalanced=true and parent= 0)) do 
case (subTreeBalanced, T^, , parent) of 
subTreeBalanced = false, Tv < Avg, parent any value 

receiveTokensFromAll{) //check for incoming tokens and update data 

bigger ^ {\\subTreeAvg{\) > Avg, i ^ parent} 
subTreeBalanced ^ (bigger= 0 and smaller= 0) 
subTreeBalanced = true, T^, < Avg, parent > 0 

rece 2 L?eTofeen(parent) //all sons have finished, 

//checking tokens coming from parent 
SubTreeBalanced = false, Tv > Avg, parent any value 

receiveTokensFromAll{) //check for incoming tokens and update data 

bigger ^ {\\subTreeAvg{\) > Avg, i ^ parent} 
if (smaller^^: 0) 
j ■«— an element € smaller 

sendToken(j) //send token to Vj and update data 

smaller ^ {\\subTreeAvg{\) < Avg, i ^ parent} 
endif 

subTreeBalanced ■«— {bigger= 0 and smaller= 0) 
subTreeBalanced = true, T^; > Avg, parent > 0 

sendTofcen(parent) //send token to Vparent o.nd update data 

endcase 

endwhile 



S'2.1: [subTreeBalanced = false and T„ < Avg]. Nodes remain in this state 
receiving tokens and updating local variables until they have more than Avg 
tokens and then change state, (from S'2.1 to S2.3). 

S2.2: [subTreeBalanced = true and T„ < Avg, parent > Oj. Nodes in this 
state need not to exchange tokens with children, since all such subtrees have 
Avg number of tokens. They wait for tokens from their parent (a node in this 
state is not the root). 

S2.3: [subTreeBalanced = false and T„ > Avg). Nodes in this state need to 
balance their descending subtrees: they receive tokens from any node and then 
send tokens to subtrees with less than Avg number of tokens. 

S'2.4: [subTreeBalanced = true and T„ > Avg, parent > Oj. Nodes in this state 
have already balanced subtrees rooted at their children and send extra tokens 
towards their parents. 

As we already mentioned, a node stops exchanging tokens if it has balanced its 
subtrees (subTreeBalanced = true and T„ = Avg) (Fig. 1 part (b)) or if it has 
balanced the subtrees of all its children (subTreeBalanced= true) and it is the 
root (parent = 0)) (Fig. 1 part (c)). 

3.3 Phase 3 

In this phase tokens keep on moving until a Finished message is sent. 

In order to distinguish between tokens and finished messages we use two inter- 
nal variables, namely msg. Token and msg. Finished. At any time if msg.Finished= 
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true then msg. Token = NULL, else if msg. Finished = false then msg. Token 
contains a token. 

In phase 3 extra tokens sent to the root during phase 2 are redistributed 
down the tree. The root (and all the other nodes recursively) sends its extra 
tokens to subtrees which can accept them^. In this phase each node can be in 
one of the two following states: 

S'3.1: [T„ > Avg+1 and parent = 0]. Nodes in this state are root of a subtree 
and send their extra tokens to subtrees which can accept them. 

S'3.2: [T„ any value and parent > 0]. Nodes in this state receive messages from 
their parent. If they receive tokens they update local variables. If they receive a 
Finished message they set parent to 0 and become a root. 

When a node is root and has at most Avg+1 number of tokens it sends a Finished 
message to all its children. As proven in Section 4, eventually all nodes become 
a root and receive a Finished message (Fig. 1 part (d)). 



Phase 3 

INITIALIZATION 

p ^ — parent //p is the parent of v 



subTreeT[i] ^ subTreeT[i]+l 
Tv any value, parent > 0 
msg receive(parent) 
if(msgT^NULL) 

if( msg. Finished =true) 

I parent ■«— 0 
else 



i 



//send token to Vi 
//tokens at v 

//tokens in subtree rooted at vi 



//if a message msg received 
//if message received was Finished 
//the parent finished : become root 
//else a token was received 



while(not(Ti, <Avg+l emd parent= 0)) do 
case (Tv » parent) of 
Tv > Avg+1, parent = 0 

i ^ a neighbor for which subTreeAvg(]) < Avg + 1, 
msg. Token -j— a token at v 
msg. Finished ^ false 
safeSend{\, msg) 

- 1 



I Tt, ■« — T^, + 1 

I subTreeT [parent] 
endif 
endif 
endcase 
endwhile 
for i^ 1 to deg do 
I msg. Token ■«— NULL 
I msg. Finished ^ true 

I 

I I safeSend{\, msg) 

I endif 
endfor 



subTreeT [parent] — 1 



//for all neighbors vi 



//except the parent 
//send Finished to Vi 



4 Proof of Correctness 

In this Section we prove the correctness of our algorithm. 

^ Eventually, each node ends up with either Avg or Avg+1 tokens, then each subtree 
must contain less than Avg+1 tokens. 
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Fig. 1. Example of execution of our algorithm. Dashed arrows represent information 
messages, solid arrows represent token exchanges, (a) Phase 1: local information is sent 
from leaves to a single node which becomes the root of the tree, (b) Phase 2: token 
are redistributed in subtrees until each subtree has Avg number of tokens, (c) End of 
Phase 2: each node has Avg tokens but the root that may contain some extra token, 
(d) End of Phase 3: a perfect token distribution is achieved and each node receives a 
Finished message. 



4.1 Phase 1 

Let SI. final be the final state of Phase 1 defined by [S =deg and R =deg]. 

Theorem 1. At the end of phase l,yv G V we have that Ty =T and Ny = N. 
In addition, exactly one node (the root of the tree) has parent = 0 while each 
other node has parent = i where i is the number of the channel which links the 
node to its parent in the rooted tree. 

Sketch of the proof. At the beginning of the computation, each node of degree 
1 is in state 51.2 while all the other nodes are in state 51.1. Then, each node 
in state 51.2 sends its subtree information to nodes in state 51.1 and moves to 
state 51.3. Eventually each node in state 51.1 moves to state 51.2 and decides 
which node is its parent. When state 51.2 contains only two nodes, one of them 
moves to state 51.4. All the other nodes go to state 51.3 and wait for global 
information. The node in state 51.4 is the root, computes global information 
and sends it to its neighbors. Neighbors receive such information, identify the 
node which sent them as they parent, move to state 51.5, and then forward it to 
their children. At the end of this process all the nodes receive global information 
and set T„ = T and N„ = N. Eventually each node moves to state SI. final. □ 
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Fig. 2. State transitions diagram of phase 1. 



4.2 Phase 2 

During Phase 1 we implicitly elect a leader, namely the only node with parent=0. 
We refer to this node as the root of the tree. Links to parent nodes are provided 
by local variables parent. 

Lemma 1. Let Vj be a node different from the root of the tree and such that 
subTreeAvgiJ)^ > Avg. Let v be any node of the subtree rooted at Vj. Then at 
the end of phase 2 v contains exactly Avg tokens. 

Proof. We make the proof by induction on the height of the subtree: 

Base Case. If the subtree has height 0 (only one node) subTreeBalanced = true. 
If the node has Avg tokens it will do nothing in Phase 2 and the lemma is true, 
if it has more it will send tokens to its parent (state 52. 4) until it remains Avg 
tokens. 

Induction. If the subtree has height n + I it has subsubtrees of height 
n. Since the algorithm behaves exactly in the same way for every node in 
the tree except the root we can apply induction hypothesis on them. When 
subTreeBalanced = true the root of the subtree has at least Avg tokens and 
sends extra tokens toward the root of the tree as in the base case (state 52.4) 
and every other node finishes with Avg tokens for induction hypothesis, so the 
lemma holds. If subTreeBalanced = false for induction hypothesis all subsub- 
trees with subTreeAvg > Avg will have Avg tokens in each node and send ex- 
tra tokens to the root of the subtree. The root will receive such tokens (states 
52.1 and 52.3) and, since the total number of tokens of the subtree is at least 
subTreeNp] • Avg, will eventually have at least Avg tokens. If there are subsub- 
Trees with subTreeAvg < Avg their root can not be in state 52.4 and so will 
receive tokens from their parent. Their parent, the root of the subTree, has more 
than Avg tokens and sends some of them to such subsubtrees (state 52.3). Rea- 
soning again on number of tokens the root will make such subsubtrees to have 
subTreeAvg = Avg remaining at least Avg tokens. At this point subTreeBalanced 
will become true and the lemma holds, as explained above. □ 

® Each neighbor of Vj has a different value for subTreeAvgijf) but, being the tree 
rooted, we refer the value owned by the neighbor Vparent which is the parent of vj. 
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Theorem 2. At the end of phase 2 the root of the tree has at least Avg tokens 
while all the other nodes have exactly Avg tokens. 

Proof. Consider subTreeBalanced = true at the root of the tree. Then we apply 
lemma 1 to all the children Vj of the root: all nodes in their subtrees have exactly 
Avg tokens. Then, for definition of Avg, the root has at least Avg tokens and the 
theorem holds. Consider subTreeBalanced = false at the root of the tree. Then 
we apply lemma 1 to all the children vj of the root having subTreeAvgQ) > Avg. 
During phase 2 the root receives tokens (states 52. 1 and 52.3) from subtrees 
having subTreeAvgQ) > Avg. Since the total number of tokens of the tree is 
at least N- Avg, when such subtrees reach balancing the root has at least Avg 
tokens. If there are subTrees having subTreeAvgQ) < Avg the root will have 
more than Avg tokens and will send some of them to such subsubtrees (state 
52.3). The root vj of such subtrees can not be in state 52.4 and then receives 
tokens sent from the root. Reasoning again on number of tokens the root will 
make such subsubtrees to have subTreeAvgQ) = Avg remaining at least Avg 
tokens. At this point subTreeBalanced will become true. We apply lemma 1 to 
all the children Vj of the root and the theorem holds. □ 



4.3 Phase 3 

Theorem 3. After phase 3 each node in the tree has Avg or Avg +1 tokens. 

Proof. We make the proof by induction on the height of the tree: 

Base Case. If the tree has height 0 (only one node) being the total number of 
tokens at most Avg +1 the theorem is trivially true. 

Induction. If the subtree has height n + 1 it has subsubtrees of height n. If 
the root has at most Avg +1 tokens the theorem is proved for theorem 2. If it 
has more tokens it sends them to its sons until it remains Avg +1 tokens, due 
to the total number of tokens this will happen before that all subtrees have 
subTreeAvg — Avg +1. At this time the root will send Finished to all its sons. 
On the other hand each son receives tokens without sending them and sets 
parent to 0 upon receiving the Finished message. Now the algorithm repeats 
recursively on each subtree rooted at each son of the root. Since the average 
number of tokens for each subtree is at most Avg +1 the total number of tokens 
in the subtree is such that its root can send all extra tokens to sons before their 
subtrees have more than Avg +1 average number of tokens. We can then apply 
induction hypothesis to every subtree proving the theorem. □ 



5 Complexity 

We call d the maximum degree of the tree, T the total number of tokens, D the 
diameter of the tree and N the number of nodes. Phase 1 runs in (worst case) 
0{Dd) in single-port model^. Phase 2 runs in (worst case) 0{TD), while phase 3 

0(D) in multi-port model. 
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runs in (worst case) 0{D{T mod N)). Since phase 2 has the greatest complexity 
the overall complexity of the algorithm is 0{TD). 

The space needed to store local information at each node is that for subtrees 
load and number of nodes, plus a constant number of local data of size less 
or equal to the elements of subtrees array, so size complexity at single node is 
6>(d(logT + loglV)). 

In phase 2 all tokens travelling through a link go in the same direction 
and, if Avg = T /N, algorithm achieves perfect token distribution. This means 
that the number of token exchanges is optimum. If Avg < T /N then at most 
T mod N tokens generate extra token exchanges. The longest path done by such 
tokens is from a leaf to the root. So the number of token exchanges is at most 
0{D{T mod N)) greater than optimum. In phase 3, unless extra tokens were al- 
ready at the root, we have already done more token exchanges than the optimum, 
so all the 0{D{T mod N)) worst case token exchanges are beyond the optimum. 
The total number of token exchanges is then no more than 0{D mm{T, N}) 
greater than optimum. 



6 Conclusions and Further Work 

We presented a distributed algorithm which computes a perfect token distribu- 
tion on anonymous tree connected networks without taking as input any global 
information about the tree structure or about the initial distribution of tokens. 

Some other questions remain open and deserve further investigation. Among 
them we wish to point out the following two that are strictly related to the 
results presented in this paper. 

(1) Each node in our algorithm has a local memory of size 0{d{logT + logN)). 
Is it possible to compute perfect token distributions on trees assuming that each 
node has only constant size local storing capacity? 

(2) Is it possible to compute perfect token distribution in a fully decentralized 
way minimizing the number of token exchanges? 
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Abstract. Link-based information structures such as the web can be 
enhanced through the addition of hotlinks. Assume that each node in 
the information structure is associated with a weight representing the 
access frequency of the node by users. In order to access a particular 
node, the user has to follow a path leading to it from the root node. By 
adding new hotlinks to the tree, it may be possible to reduce the access 
cost of the system, namely, the expected number of steps needed to reach 
a leaf from the root, assuming the user can decide which hotlinks to follow 
in each step. The hotlink assignment problem involves finding a set of 
hotlinks maximizing the gain in the expected cost. The paper addresses 
this problem in the more realistic greedy user model recently introduced 
in [3], and presents a polynomial time 2-approximation algorithm for the 
hotlink assignment problem on trees. 



1 Introduction 

1.1 Motivation 

Large databases containing diverse information types are often organized on the 
basis of a hierarchical classification index. The Web, for example, is provided 
with this type of organizational scheme in Yahoo [6] and the Open Directory 
Service [7]. A user who searches for an information item in a hierarchically 
structured database has to follow a path from the root to the desired node in 
the classification tree. The degree of this tree is often relatively low and its 
average depth is often high, causing the search path to be long. Another factor 
contributing to the length of the search path is that the classification usually does 
not take into account the access frequency of the items by users. This implies 
that the depth of certain frequently visited items in the tree may be high. Hence 
the access cost of the tree, defined as the expected number of steps needed to 
reach an item from the root, may be high. 

An ad-hoc solution used in practice in the Web is to augment the tree orga- 
nization with “hotlinks” added to various nodes. These hotlinks lead directly to 
the most frequently accessed items. The hotlinks to be added should be selected 
based on the access probability of the various items in the database. 
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When searching for an item in the original tree structure, the user starts from 
the root and advances along tree edges along the unique path to the desired 
destination leaf. An implicit assumption underlying the common hierarchical 
approach is that at any node along the search in the tree, the user is able to 
select the correct link leading towards the desired leaf. Evidently, the user does 
not necessarily know the tree topology. However, the user usually has some 
general knowledge about the domain, and the links followed at any node are 
selected on the basis of the natural partitioning of the domain. Thus, when the 
user sees several tree edges and hotlinks at a node, the user is capable of selecting 
the right link downwards in the tree. 

With hotlinks added, the situation is more involved, since the resulting 
hotlink-enhanced index structure now becomes a directed acyclic graph (DAG), 
with possibly more than one path to some destinations. Again, it is assumed 
that when faced with a hotlink in the current page, the user will be able to tell 
whether or not this hotlink may lead it to a closer point on the path to the 
desired destination. However, the hotlink selection process is not known to the 
user. Subsequently, while at a particular node, the user knows only the hotlinks 
emanating from that node (or possibly also hotlinks emanating from nodes previ- 
ously visited by it on the way from the root to the current node). In particular, 
the user cannot know whether any hotlinks emanate from descendants of the 
current node, or where such hotlinks may lead. 

This implies that any approach taken for designing a hotlink-enhanced index 
structure must take into account certain assumptions regarding the user’s search 
policy. Two models have been considered concerning user capabilities. One natu- 
ral model, referred to as the “clairvoyant” user model, captures situations where 
the user somehow knows the entire topology of the enhanced structure. This 
model is based on the following assumption. 

The clairvoyant user model: At each node in the enhanced structure, the 

user can infer from the link labels which of the tree edges or hotlinks available at 
the current page is on a shortest path (in the enhanced structure) to the desired 
destination. The user always chooses that link. 

This model appears to be too strong to be considered realistic. An alternative 
model recently proposed in [3] is based on the assumption that the user does 
not have this knowledge. This forces the user to deploy a greedy strategy. 

The greedy user model: At each node in the enhanced structure, the user 

can infer from the link labels which of the tree edges or hotlinks available at the 
current page leads to a page that is closest in the original tree structure to the 
desired destination. The user always chooses that link. 

In this paper we address the optimization problem of constructing a set of 
hotlinks that achieves a maximum improvement in the access cost. The problem 
has been given an exact algorithm on trees in the greedy user model in [3], but 
the complexity of that algorithm might be exponential in the depth of the tree, 
hence it yields a polynomial time solution only for trees of logarithmic depth. 
In the clairvoyant user model, the problem has been given a polynomial time 
2-approximation algorithm on arbitrary trees [4]. Unfortunately, this algorithm 
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does not appear to extend to the more realistic greedy user model. Our goal 
here is thus to develop an approximation algorithm for the problem on arbitrary 
trees in the greedy model. 

1.2 Formal Definitions 

Formally, given a rooted directed tree T with root r storing the database, a 
hotlink is a new directed edge which is not part of the tree. The hotlink starts at 
some node v and ends at (or leads to) one of its descendants u. To each node v we 
may assign at most one hotlink. Each node x of T has a weight p{x), representing 
its access frequency, namely, the fraction of the user visits to that node, compared 
with the total number of user visits to tree nodes. Normalizing the weights allows 
us to interpret p{x) as the probability that a user wants to access node x. For 
simplicity, we assume that for each nonleaf node a;, p{x) = 0. Denote by r{x) 
all X immediate descendants in the tree, and define P{x) = Yl,y^r(x) where 
a leaf x, will have value P{x) = p{x). 

Let S' be a set of hotlinks constructed on the tree T (obeying the bound of K 
outgoing hotlinks per node). Denote by T 0 S the enhanced structure obtained 
by augmenting the tree T with the hotlinks of S. Denote the length of a path L 
(in edges) by \L\. 

In the greedy model, let Dg(T, v) and Dq{T 0 S, v) denote, respectively, the 
greedy path from the root to the node u in T (i.e., without using the hotlinks) 
and the greedy path to u in T 0 S (i.e., including the hotlinks). The gain from 
a hotlink assignment S is defined as 

9g{T,p,S) = ^ ^|L>g(T,u)| - |Dg(T0 S,u)|Vp(u) . 

vGLeaves{T) 

Two optimization variants can be considered. The hotlink enhancement problem 
requires finding a set of hotlinks S optimizing the expected cost. The hotlink as- 
signment problem requires finding a set of hotlinks S which optimizes gc{T,p, S) 
and achieves the optimal gain, namely, 

9g{T,p) = max{5G(T,P,S)}. 

A set S attaining this optimum is termed an optimal set of hotlinks. 

In the clairvoyant model, denote by Dc(T, v) and Dc{T 0 S', u), respectively, 
the clairvoyant (shortest) path that does not include hotlinks and the clairvoyant 
path including hotlinks from the root to node v. Subsequently, define the gain 
gc{T,p, S) from a hotlink assignment S and the optimal gain g’^{T,p) as in the 
greedy model, except for replacing the subscript G by C. 

Note that while the optimization problems are equivalent (in the sense that 
a set of hotlinks S is an optimal solution to one if and only if it is an optimal 
solution to the other), the corresponding approximation versions may be very 
different, namely, the same set of hotlinks S may well-approximate one but not 
the other. 
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1.3 Related Work 

Most past discussions on the problem concentrated on trees and DAG’s. The 
NP-completeness of the hotlink enhancement problem on DAGS is proven in [1] 
by a reduction from the problem of Exact Gover by 3-Sets. (As this problem 
is equivalent to the hotlink assignment problem, the latter is NP-complete as 
well.) That article also discusses several distribution functions on the leaves, 
including the uniform, geometric and Zipf distributions, but restricts the dis- 
cussion to full binary trees. An interesting analogy is presented therein be- 
tween the hotlink enhancement problem on trees and coding theory. A clas- 
sification tree can be interpreted as a coding of words (associating a move down 
to the tth child with the letter ’i’). Under this interpretation, every leaf cor- 
responds to a codeword. The addition of a hotlink adds a letter to the alpha- 
bet. This provides a lower bound for the problem based on Shannon’s theorem. 
By this lower bound, denoting the entropy of the probability distribution p 
by H(p) and the maximal degree of the tree by A, the expected access cost, 

cost(TeS,p) = ElEveLeavesiT) | T>c (T 0 S', u) | •p(u)], is at least [1,2]. 

Based on these bounds, an approximation algorithm for the hotlink enhance- 
ment problem on bounded degree trees in the clairvoyant user model is presented 
in [2]. This algorithm approximates the expected access cost, cost(T(BS,p). The 
access cost guaranteed by this algorithm is no more than 
+ hence the approximation ratio achieved by the algorithm is 

+ iog(zA-n)-(i1og^i)/(z^-H) ' ^’^ich is in general at least log(Z\ 0 1). 

Another recent article [5] discusses an interesting application of hotlink as- 
signments in asymmetric communication protocols for achieving better perfor- 
mance bounds. 

The greedy user model was presented in [3]. The underlying assumption is 
that the user has limited a-priori knowledge regarding the structure of the clas- 
sification tree. The paper proposes an exact algorithm for solving the hotlink 
enhancement problem in that model on trees. The time complexity of that algo- 
rithm is polynomial over trees of logarithmic depth. The solution is also general- 
ized to situations where more than one hotlink per node is allowed. For the case 
in which the distribution on the leaves is unknown, the paper gives an algorithm 
guaranteeing (an optimal) logarithmic upper bound on the access cost. 

The hotlink assignment problem (namely, optimizing the gain) is introduced 
in [4]. The paper presents a polynomial time approximation algorithm for the 
hotlink assignment problem in the clairvoyant model. The algorithm achieves a 
worst case approximation ratio of 2, and works on general graphs. Unfortunately, 
this algorithm does not apply in the greedy user model, and it is not clear how 
to extend it to that model. Note that the algorithms of [2] and [4] cannot be 
directly compared as the approximation applies to different measures. 

1.4 Our Results 

In this paper we present a new polynomial time approximation algorithm for the 
hotlink assignment problem on rooted trees. The algorithm applies to both the 
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clairvoyant model and the greedy model, and achieves an approximation ratio 
of 2. The algorithm is based on restricting attention to hotlinks of length 2, 
namely, hotlinks from a node to one of its grandchildren, and finding the best 
length two hotlink assignment for the given tree structure. The approximation 
is based on the crucial observation that the best length two hotlink assignment 
for a given tree T yields at least half the gain of the optimal (unrestricted) 
hotlink assignment for T. Moreover, we use this fact to show that the solutions 
generated by our algorithm achieve an approximation ratio of 2 in both the 
clairvoyant model and the greedy model, implying that the ratio between the 
optimal solutions in the two models is at most 2. 

2 Basic Properties and Preliminaries 

Denote by T„ the subtree of T rooted at node v. Define the height of a tree T, 
denoted height{T), as the length of the longest path from the root of T to some 
leaf u. For each node v denote its set of children by C{v). 

Since any path from the root to a node u in the greedy model cannot be 
shorter than the shortest possible path to u, which is chosen in the clairvoyant 
model, we have: 

Lemma 1. For any tree T, and for any assignment S of hotlinks, 

9 g{S,T,p) < gc{S,T,p) . 

Let us represent a hotlink from node u to node u as a pair of parentheses, with 
u marking the left parenthesis and v marking the right one. The hotlinks placed 
along any path from the root can now be viewed as a sequence of parentheses. 
We say that a set of hotlinks S is well-formed if for any path P from the root 
to a leaf over the tree, the parentheses sequence of S on this path is balanced, 
namely, hotlinks do not cross each other. 

Lemma 2. For a well-formed hotlink assignment S, the gains in the greedy user 
model and in the clairvoyant model are the same, i.e., gG{S,T,p) = gc{S,T,p). 

Proof. For a well-formed set of hotlinks S forming a hotlink enhanced index 
structure T 0 S', the greedy path from the root to any leaf v coincides with the 
shortest path, i.e., Dq{T ® S,v) = Dc{T 0 S,v). □ 

Based on these observations, the following lemma is proved in [3]. 

Lemma 3. [3] For every index tree T, there exists an optimal solution to the 
hotlink assignment problem in the greedy user model which is well-formed. 

The length of a hotlink is defined as the number of edges in the path connect- 
ing its end points in the tree. A set of hotlinks S is called a length two assignment 
if all of its hotlinks are of length 2. Denote by Length two Flotlink Assignment 
(L2FIA) the variant of the hotlink assignment problem in which it is required 
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to find the set of hotlinks S which optimizes gc(T,p, S) under the constraint 
that the hotlink set S is a length two assignment. Denote this optimal gain by 
gQ^^{T,p) = u\&yis{gc{T,p,S) | S' is a length two assignment}. 

In the same manner, define LkHA as the variant of the hotlink assignment 
problem restricted to hotlinks of length exactly k. 

Lemma 4. For every index tree T , there exists an optimal solution to the L 2 FIA 
problem in the clairvoyant user model which is well-formed. Furthermore, any 
length two hotlink assignment that is not well-formed can be manipulated into 
well-formed assignment with the same clairvoyant model gain. 

Proof. Consider a length two hotlink assignment S which is not well-formed. 
Since the length of the hotlinks is bounded by 2, the only possibility is that 
there exists a path of four nodes vi,V2,vs, V4 such that Vi+\ is the child of Vi and 
that vi has a hotlink to V3 and V2 has a hotlink to V4 (see Figure 1). In such a 
case suppose that several hotlinks were added that changed the depth of vi and 
V2 , and that their depth after the hotlink assignment is di and ^2 respectively. If 
di > c?2 then the hotlink from vi to V3 can be removed since the path from vi to 
V3 can be replaced by travelling to V2 and then directly to V3 by the connecting 
edge. In case di < ^2, the hotlink from V2 to V4 can be removed and V4 can be 
reached through V3. In either case, the assignment has lower cost and one less 
violation of the well- formed requirement. □ 




Fig. 1. An example for assignment which is not well- formed in L 2 HA. 



Corollary 1. To solve L 2 FIA it suffices to consider only well- formed length two 
assignments. 

Lemma 5. For any k > 2 , to solve LkHA it suffices to consider only well- 
formed length k assignments. 

Proof. Assume, to the contrary, that in any optimal solution there are crossing 
hotlinks. Observe the first two hotlinks that cross each other (the highest in the 
tree). The first starts at v and ends k nodes lower, the second starts at node u 
which is d nodes lower than v and ends in k nodes lower than u at node w. Then, 
since these are the highest crossing hotlinks there are no incoming hotlinks to 
the nodes between v and u and thus travelling to w through the second hotlink 
takes at least d -I- 1 steps, plus the number of steps it takes to travel to v. By 
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removing this second hotlink the distance to w does not increase since we can 
travel to v plus take the first hotlink and then travel d steps down to w, and 
that will also be c? + 1 steps. □ 

Lemma 6. For a set of well-formed length two hotlinks S = Si[J S 2 , where S\ 
and S 2 are disjoint, the gain from S is the sum of the gains from Si and S 2 , 
namely, 

gc{T,p,S) = gc{T,p,Si) + gc{T,p,S 2 ) , 

9g{T,p,S) = gG(T,p,Si) gG{T,p,S2) ■ 

Proof. Consider the greedy (or clairvoyant) path to some node u in T0 S'!. It 
includes some subset 5“ of the hotlinks from ^i. Likewise, the greedy path to u 
in T 0 S '2 includes some subset Sf of the hotlinks from S 2 ■ Since S' = IJ S '2 

is well-formed, no two hotlinks cross each other, and since it is a length two 

assignment they cannot overlap each other. Hence in an assignment containing 
all the hotlinks of S, a user travelling from the root r to u can take both sets of 
hotlinks S“ and Sf, and hence the gain achieved is the sum of the two. □ 

Corollary 2. For a set S of well-formed length two hotlinks, 

9c(T,p,S) = ^gG{T,p,s) . 
ses 




100 * ^20 



Fig. 2. An example tree T. 



This corollary does not hold in case of non-well-formed assignments (even in 
length two assignments). For example, in Figure 2, the hotlink from V 2 has gain 
100 • 1, and the hotlink from vi has gain 100 • 1 0 20 • 1 = 120. The sum of the 
gains of these two hotlinks is thus 220, but when assigning both of them to T, 
the gain is only 100 • 1 0 20 • 1 = 120. A better choice would be to assign a hotlink 
from v\ to w with separate gain 30 • 1 = 30 and a hotlink from V 2 as before. The 
sum of gains for the two hotlinks is 100 0 30 = 130 < 220 but the actual gain 
achieved by adding both of them to the tree is 130, which is higher. 
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3 A Polynomial Time Algorithm for L2HA 

The number of possible length two assignments for a single node is bounded by 
where A is the maximum degree of the tree T. Therefore, based on Corollary 
2, a dynamic programming algorithm can be devised to solve the problem. The 
algorithm is based on filling a table M of length n with a value for every node 
and a,n X A table M' with a value for every node-child pair. The value of M{y) 
corresponds to the maximum gain possible in the subtree while M'{v,u) for 
a vertex v and its child u is the greatest gain possible in the subtree Ty without 
hotlinks directed from v to one of u’s descendants. 

The calculations proceed from the leaves up. The base of the recursion is 
when the tree rooted at v has height 2. In such a case the best possible length 
two hotlink assignment is to the heaviest grandchild of v and the value of M{v) 
is set to be the weight of that grandchild. The value of M'(v,u) is set to be the 
maximum weight grandchild of v which is not a descendant of u. If the tree is of 
height{Ty) < 2 then the value of both M{v) and M'{y,u) is set to 0. 

M{y) is calculated by breaking the problem to subproblems. For every child 
u of V, calculate the gain if the hotlink would be directed to one of u’s children. 
This gain is the sum of all gains from the other children of v plus the gain from a 
hotlink directed to some child w oiu (which is 1 • P{w)). In addition we need to 
take into account the possible gain from hotlinks directed from u, but restricting 
u from choosing any hotlink to w’s descendants. The grandchild w is chosen from 
all children of u to maximize the gain of the assignment. 

M (v, u) is also calculated similarly. Formally, the values of M (v) and M'(y, u) 
are defined as follows (see Figure 3). 




Fig. 3. Calculating M{v) and M'{v,u). 



M{v) = jnax 

uGr{v) 





u^r{v),u^u 
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The algorithm generates all possible length two we 11- formed assignments. 
Hence by Corollary 1 the optimal gain for L2HA is achieved. The time bound is 
the number of table entries multiplied by the time to calculate each entry, which 
is polynomial. 

4 A 2 Approximation for Hotlink Assignment 

4.1 The Approximation 

We now prove that in the clairvoyant model the optimal solution for L2HA yields 
a 2 approximation for the hotlink assignment problem, namely, 2 • g'^ (T,p) > 
9c(T,p)- 

Intuitively, every hotlink that shortens k steps to a certain leaf can be broken 
into a chain of [(fc-h 1)/2J hotlinks of length 2, henceforth referred to as a worm 
of hotlinks, and the total gain achieved will be at least k/2. For example, in 
Figure 4(a) the direct hotlink from r has a gain of 5 • 100 = 500, and the worm 
gain is 3 • 100 = 300. in Figure 4(b) the direct hotlink from r has a gain of 
4 • 100 = 400, and the worm gain is 2 • 100 = 200. 

r 



A-- ' 100 

100 

(a) (b) 

Fig. 4. A worm example. 
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Lemma 7. For every tree T, the gain of an optimal solution to the L2FIA prob- 
lem is at least half of the optimal gain using arbitrary length hotlinks in the 
clairvoyant user model, namely, 

2-g*c,,{T,p) > g*c{T,p) . 

Proof. By induction on the tree height h. (For simplicity the proof will be pre- 
sented for binary trees, but it applies to all trees.) 

For a tree of height 2 with root v, the value of M{v) is the weight of the 
heaviest grandchild. This gain is achieved by choosing a hotlink from v to that 
leaf, which is also the best possible choice of hotlink assignment in the clairvoyant 
model, and thus the gain of the hotlink assignment is the same. In particular, 
2-g*c^^{T,p)>g}.{T,p). 

Assume the claim is correct for any tree of height at most h, and consider 
a tree of height h 1. Consider the optimal assignment of an arbitrary length 
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hotlink from the root v to some node w. Without loss of generality, this hotlink 
leads to the left subtree, Ty^. By the inductive hypothesis, the solution for the 
right subtree Ty^ can be replaced by an L 2 HA assignment with at least half the 
gain of the optimal assignment on that subtree, i.e., G*Q^^{Ty^) > 

The subtree is also of height at most h, hence its entire hotlink assign- 
ment, Sha, can be replaced by a length two hotlink assignment Sl2HA, still 
attaining at least half the gain. 

The only remaining modification required is to discard the hotlink from v 
while still retaining at least half of the gain achieved by this hotlink, without 
decreasing the gain of the other length two hotlinks assignments. This is done 
by creating a worm W{v,w) from v to w. This worm may ’’collide” with other 
hotlinks that were already assigned by Sl2HA, in the sense that it may require 
setting a hotlink from some node u along its path, where such a hotlink has 
already been selected. When such a case occurs, we remove the previous hotlink 
from Sl2HA and add the one required by the worm. Denote this assignment by 
Sl2HA„ ■ It is clear that such a worm achieves half the gain of a direct hotlink 
from V to w. Hence to complete the proof, it remains to show that the worm 
W{u,w) does not decrease the gain achieved by the set of hotlinks Sl2HA- 

Consider the structure of the worm W{y,w). It has a length two hotlink 
emanating from every second node on the path from v to w. Denote the depth 
of nodes on the path as their distance from the node v (see Figure 5). The nodes 
that are assigned hotlinks by the worm are all even depth nodes (see Figure 5), 
so the only possible decrease in the gain achieved can be caused by previous 
hotlinks from Sl2HA that were removed due to the addition of the worm. The 
gain achieved by any assignment is the summation over all the nodes, of the 
shortening of nodes cost multiplied by its relative weight. Therefore, if for every 
node u the shortening achieved by Sl2HA is achieved also after adding the worm 
then the gain remains the same. 




depth=2d-2 

depth=2d 



(a) 



(b) 



Fig. 5. The two possibilities for node u. 
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Consider a node u in and let y be the lowest common ancestor of u 
and w (see Figure 5). Observe that at most one length two hotlink on the path 
leading from v to u could have been deleted by the worm. This hotlink must 
have originated at some even depth node, specifically, either y or its parent (see 
cases (a) and (5) in Figure 5), and aimed at some grandchild node x which is u’s 
ancestor and is not on the path to w (specifically either j/’s child or grandchild on 
the path to u). From this node x and onward the path to u remains untouched. 
Thus we must show that the travelling path to x is of same length as in the 
original Sl 2 HA assignment. 

Suppose X is of depth 2d from v (as explained it must be an even depth). In 
the Sl 2 HA assignment, the path to x first travels from v to vl and then follows 
the shortest path to u, consisting of tree edges and possibly some hotlinks. Since 
X is at depth 2d, the optimal choice of hotlinks for a: is a direct worm W {vl, x) 
of length d — 1 (since the length of the path from vl to a: is 2d — 1 the worm will 
end at the parent of a:) and the total minimal length of the path from vl to x 
will be (d — 1) + 1 and from u to a: a total of \Dc{T 0 Sl 2 HA, a;)| > d 0 1 steps 
to reach x. In the Sl 2 HA„, assignment the hotlink to x was removed, but there 
was a direct worm from v up until the last hotlink. The total length of the new 
path to X is thus \Dc{T © Sl 2 HA„,,x)\ < 0 2 = d 0 1. □ 

Theorem 1. The hotlink assignment problem has a 2 approximation algorithm 
both in the clairvoyant and the greedy model, namely, 2 ■ gQ^^{T,p) > gQ(T,p). 

Proof. The L2HA optimal solution, i® well-formed and thus by Lemma 

2 its value is the same under the greedy model. Namely, ga{Sf 2 HA^'^’P) ~ 
9ci.Sl2HA^T,p). Hence 

2 • 9g{Sl2HAj'^^P) = 2 • 9c{Sl2HAj"^jP) 

= 2-g*c,,{T,p) > g*c{T,p) > gUT,p) , 

hence Sf 2 HA i® ^ solution for the greedy model with approximation ratio 2. □ 

We conclude with a bound on the gap between the clairvoyant and greedy mod- 
els. 

Corollary 3. For any tree T, g*G{T,p) < gc{T,p) < 2 • g*G{T,p). 

Proof. By the theorem, there exists a well-formed solution Sf 2 HA using length 
two hotlinks for the tree T, satisfying gG{T,p) > gG{Sf 2 HAi"^^P) — 9c('^’P)/‘^- 
Recalling also Fact 1, the claim follows. □ 



4.2 Generalization to K Hotlinks in Each Node 

The same ratio of 2 can be proven when K hotlinks are allowed per node. In 
fact, when adding the K worms from v to wi<i<K, we can remove all K hotlinks 
previously assigned to a node by Sl 2 HA when collision occurs, since the argument 
above will apply to all of the hotlinks destinations. Hence we are left with K — 1 
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additional hotlinks we can now assign to reduce farther the cost (in some cases it 
will be possible to use it in others not). But it seems that generally the value of 
L2HA relative to the general hotlink assignment problem changes for the better 
as K increases. 

4.3 Example of Tightness 

There are instances where the best gain achieved by length two hotlinks is exactly 
half of that possible by general hotlinks. An example is given in Figure 6. For 
this example, the optimal length two hotlink assignment shown in Figure 6(a) 
yields gain 10 + 2 • 20 = 50, and the optimal arbitrary length hotlink assignment 
shown in Figure 6(b) yields gain 2 • 10 + 4 • 20 = 100. 





Fig. 6. Tight example. 
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Abstract. A Boolean value of given a priori probability distribution 
is transmitted to a deciding agent by several processes. Each process 
fails independently with given probability, and faulty processes behave 
in a Byzantine way. A deciding agent has to make a decision concerning 
the transmitted value on the basis of messages obtained by processes. 
We construct a deterministic decision strategy which has the provably 
highest probability of correctness. It computes the decision in time linear 
in the number of processes. 

Decision optimality may be alternatively approached from a local, rather 
than global, point of view. Instead of maximizing the total probability 
of correctness of a decision strategy, we may try to hnd, for every set of 
values conveyed by processes, the conditionally most probable original 
value that could yield this set. We call such a strategy locally optimal, as 
it locally optimizes the probability of a decision, given a set of relayed 
values, disregarding the impact of such a choice on the overall probabil- 
ity of correctness. We construct a locally optimal decision strategy which 
again computes the decision value in time linear in the number of pro- 
cesses. We establish the surprising fact that, in general, local probability 
maximization may lead to a decision strategy which does not have the 
highest probability of correctness. However, if the probability distribu- 
tion of the Boolean value to be conveyed is uniform, and all processes 
have the same failure probability smaller than 1/2, this anomaly does 
not occur. 



1 Introduction 

In faulty environments, decisions must often be made on the basis of erroneous 
or corrupted data. This is the situation, e.g., in system-level diagnosis [10] where 
processors have to be diagnosed on the basis of tests conducted by other pro- 
cessors, and faulty testers are unreliable. In network communication, messages 
travel to their destinations by multiple paths, some of which contain faulty com- 
ponents and may relay corrupted information. In nuclear reactors, temperature 
of the core is monitored by multiple sensors, some of which may be dysfunc- 
tional. In all these situations it is desirable to achieve fault tolerance, i.e., the 
ability to make correct decisions with high probability, in spite of faults occur- 
ring in the system. The fundamental mechanism used to achieve this capacity is 
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based on redundancy. The same information (diagnostic decision, message in a 
network, temperature value) is conveyed by multiple relayers (processors, com- 
munication paths, sensors). A deciding agent makes a decision concerning the 
transmitted information based on data obtained through these relayers, using 
some kind of voting mechanism which maps each set of obtained values to a 
single decision. It is important to design strategies of the deciding agent that 
have a high probability of correctness. 

In this paper we consider a very general model of a fault-prone system in 
which a decision has to be made on the basis of unreliable information. We as- 
sume that a Boolean value is conveyed to the deciding agent by several processes. 
The a priori probability distribution of this value can be arbitrary and is known 
to the agent. If the agent does not have any information on the distribution, it 
may assume that it is uniform. Relaying processes (which model processors in 
a system, communication paths in a network, sensors in a monitoring system) 
are assumed to fail independently (and independently of the stochastic choice of 
the value to be conveyed), with known (but arbitrary and not necessarily equal) 
probabilities. Fault-free processes relay the correct value but faulty ones may be- 
have arbitrarily, in a Byzantine way (they may relay the value correctly, corrupt 
it, or relay no value at all). The deciding agent receives the vector of relayed 
values and must make a decision concerning the original value. The aim is to de- 
sign a deterministic decision strategy which has the highest possible probability 
of correctness. 

1.1 Our Results 

We first define precisely the probability of correctness of a (deterministic) de- 
cision strategy. This is not an obvious notion, due to the Byzantine behaviour 
of processes: our model involves a mixture of stochastic and adversarial ingre- 
dients (on the one hand random distributions of the decision values and faulty 
processes, and on the other hand the Byzantine nature of faults) and both these 
ingredients have to be accounted for in the definition of correctness probability. 

Given this definition we design a decision strategy and prove that it has 
the highest possible probability of correctness among all (deterministic) decision 
strategies. This strategy is simple to implement and the decision value is com- 
puted in time linear in the number of processes. In fact, it is computed in time 
linear in the number of processes whose failure probability is less than 1 /2, if such 
processors exist, and in constant time if they do not exist. Indeed, it turns out 
that the optimal decision strategy is based only on data conveyed by processes 
whose failure probability is less than 1/2, if such processes exist. Otherwise, the 
a priori more probable Boolean value is chosen. 

Decision optimality may be alternatively approached from a local, rather than 
global, point of view. Instead of maximizing the total probability of correctness 
of a decision strategy, we may try to find, for every set of values conveyed by 
processes, the conditionally most probable original value that could yield this set. 
We call such a strategy locally optimal, as it locally optimizes the probability 
of a decision, given a set of relayed values, disregarding the impact of such a 
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choice on the overall probability of correctness. We construct a locally optimal 
decision strategy which again computes the decision value in time linear in the 
number of processes. It is natural to ask if these two ways of optimizing the 
decision are equivalent. The surprising answer is no. We show that, in general, 
local probability maximization may lead to a decision strategy which has not 
the highest probability of correctness. However, if the probability distribution of 
the Boolean value to be conveyed is uniform, and all processes have the same 
failure probability smaller than 1/2, this anomaly does not occur. 



1.2 Related Work 

Voting mechanisms have been proposed as a fundamental way to achieve fault 
tolerance. Coteries, defined in [7] and further studied, e.g., in [1,2,6,8,11], are 
considered to provide the most general framework for such schemes. An impor- 
tant way of using voting mechanisms, considered also in the present paper, is to 
produce a single value on the basis of many unreliable reported values. A similar 
approach was adopted in [3,4], however, unlike in our case, the authors consid- 
ered a purely stochastic model in which not only the a priori distribution of the 
transmitted value and distribution of faulty processes but also reports of faulty 
processes (and even - in one of the models - reports of fault-free processes) are 
random. This purely stochastic model facilitates the definition of correctness of 
a decision strategy but has limited applicability, as the probability distribution 
of faulty process reports is often unknown, and in some cases this behaviour may 
not be random but controlled by a malicious adversary. Our approach of treating 
faulty processes as Byzantine makes it possible to deal with this very general 
kind of faults. Moreover, in the purely stochastic model used in [3,4], computing 
the value of the optimal decision strategy given a set of data conveyed by pro- 
cesses was a task taking time exponential in the number of processes, and hence 
intractable for larger systems. This should be contrasted with our framework 
which, in addition to treating a more general type of faults, permits to compute 
the decision value in time linear in the number of processes. 

Decisions based on data subject to stochastically distributed faults of Byzan- 
tine type have been also considered in a different framework, that of system-level 
diagnosis of multiprocessor systems (cf. [10]). In particular, in [5, 9], the authors 
considered the problem of finding a diagnosis scheme having the highest proba- 
bility of correctness, when faulty processors are distributed randomly but behave 
in a Byzantine way when testing other processors. While the setting in both cases 
is different, some of the definitions and techniques have the same spirit. 

2 Terminology, Definitions and Preliminary Results 

Let b denote the Boolean value to be conveyed. Assume that the a priori proba- 
bility of 0 is p and of 1 is 1 — p. Without loss of generality assume that p > 1 — p 
(the other case is symmetric). For any v G {0, 1}, we let q{v) = p if u = 0, 
and q{v) = 1 — p otherwise. Let {1, ..., n} denote the set of processes and let pi 
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denote the failure probability of process i. We assume that processes fail inde- 
pendently. A set of values conveyed by processes, called a report, is any function 
R : {1, ...,n} — > {0, 1}. R{i) denotes the value reported by process i. Fault-free 
processes report the correct value b, and faulty processes may report any value 
or no value at all. If process i does not report any value (this may be the case 
only for a faulty process), we set R{i) = 0 by default. The set of all reports is 
denoted by TZ. 

We consider only deterministic strategies, although we evaluate them proba- 
bilistically, due to the stochastic ingredient in the environment. Hence a decision 
strategy is a function assigning a Boolean decision to any report. More precisely, 
a decision strategy is any function D : TZ — > {0,1}. We now formally define the 
probability of correctness of any decision strategy. The sample space is the set 
Q = {(G, f) : G C (1, ...,n},v e (0, 1}}. Intuitively, the elementary event (G,v) 
means that fault-free processes are exactly those in G (G will be called a good 
set) and that v is the value to be conveyed, i.e., v = b. For any G C {1, ...,n|, 
we denote G' = (1, ...,n| \ G. The probability distribution in this sample space 
is as follows: P(G, u) = q{v) ■ riieG(l “ P*) ' OieG' Pi 

This definition captures the assumption about stochastic independence of all 
failures and of the a priori Boolean value distribution. The probability function 
P is extended to all events (subsets of G) in the following standard way: P{S) = 
J2(G,v)&e ^)- 

A report R is compatible with an elementary event (G, w), if it can occur 
when the value to convey is v and the set of fault-free processes is G. This 
simply means that R{i) = v, for all i G G (recall that faulty processes may 
report arbitrary values). Let p{G,v) denote the set of reports compatible with 
(G, v). Hence p(G, v) = {R € TZ : Vi € G R{i) = uj. 

If D is a decision strategy, the event that D is correct, denoted by Cor(D), 
consists of elementary events (G, v), such that the decision strategy outputs v on 
any report compatible with (G, v), i.e., of those elementary events for which the 
strategy outputs the correct value regardless of the behaviour of faulty processes. 
Formally, Cor(D) = |(G, f) : VR G p{G,v) D{R) = u|. The reliability of the 
decision strategy D is defined as the probability of its correctness: Rel(D) = 
P(Cor(D)). A decision strategy is optimal if Rel(I?) > Rel(P'), for any decision 
strategy D' . 

An event £ C is an intersecting family if GinG 2 yf 0 for all (Gi, 0), (G 2 , 1) G 
S. An event £ C 17 is a filter if it is an intersecting family and if, for all (Gi ,v) & S 
and Gi C G 2 , we have (G 2 ,u) G £. For any intersecting family £ we define 
P{£) = {(H,u) G Q : 3(P,ti) G £ B C A}. Clearly, for any intersecting family 
£, the event P{£) is a filter. 

Lemma 1. For any decision strategy D, Cor(D) is an intersecting family. 

The following lemma shows that the probability of a filter is decreasing with 
respect to processor failure probabilities. 

Lemma 2. Let T be a filter and let 'P = (pi,...,p„), 'P' = (p},...,p(j) be two 
sets of failure probabilities of processes i = such that Pi < p) for all 
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i = Let P and P' be probability functions on fi corresponding to the 

same distribution {p, 1—p) of Boolean value b and to process failure probabilities 
V and V , respectively. Then P{P) > P'{T). 

Proof. It is enough to prove the lemma for pi^ < p[^ and pi = p[, for all i ^ io. 
Denote r = pi^ and r' = p'-^. Consider any C C {1, ...,n} \ {zq}- Let v € {0, 1}. 
Denote X = - Pi) ■ n*eC'\po}P*- P(C,v) = q(v)rX, P'{C,v) = 

q{v)r'X, P{C U {zq}, v) = q(v)(l — r)X, and P'(C U {zo}, v) = q(v)(l — r')X. 
It follows that P'(C, v) — P(C, v) = q{v)X{r' — r) and P'{C U {zq}, v) — P{C U 
{zoIjZz) = q{v)X{r — r'). Consequently, any gain of probability on an event 
(C, v) is compensated by an equal loss of probability on the event (C U {zo}, v). 
By definition of a filter, (C, u) G T implies (C U {zoIjZi) G P- This concludes 
the proof. (The possibility of sharp inequality P{fF) < P'{P) comes from the 
fact that there may exist sets C C {l,...,n} \ {zq} such that {C,v) ^ T but 
(CU{zo},z;) G.F.) □ 



3 Description and Analysis 

of an Optimal Decision Strategy 



In this section we describe a decision strategy which is then proved optimal. Call 
a process z heavy if pi < 1/2, and call it light otherwise. Denote by H the set of 
all heavy processes and denote by L the set of all light processes. The strategy is 
called Ignore-the-Suspects (IS, for short), as it completely ignores report values 
of light (suspect) processes. The definition depends on whether heavy processes 
exist. For any set G C {l,...,n}, define I{G) — OieGriH (1 -k) • n ieH\aP^- 
For example, /(0) = and /({I, ..., rz}) = I{H) = ~ pS The 

following lemma follows from the definition of heavy processes. 



Lemma 3. I{Gi) < /(G2) whenever G\ C G2. 

For any report R and any value v G {0,1}, denote by Z{R,v) the set of 
processes that “vote” for v in report R. More precisely, Z{R, v) = {i G {1, ..., rz} : 
i?(z) = u|. 

Case 1. The set PI of heavy processes is nonempty. 

For any report R, 



IS(i?) = 



0ifp-/(Z(i?,0))>(l-p)-/(Z(i?,l)) 
1 otherwise. 



( 1 ) 



In this case, the strategy compares two events, for u = 0, 1: that the value to 
be conveyed is v, all heavy processes voting for v are fault free and all heavy 
processes voting against v are faulty. Then it picks the value v yielding the more 
probable of these events, breaking ties arbitrarily (we choose to break ties in 
favour of 0 but this decision does not influence optimality). 

Case 2. The set PI of heavy processes is empty. 



IS(i?) = 0 . 



(2) 
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In this case, the strategy disregards the report and picks the a priori more 
probable value: recall that we assumed p> 1 — p. 

This concludes the description of strategy IS. Notice that, for any report R, 
in order to compute IS(i?), 0{\H\) arithmetic operations have to be performed 
in Case I and no operations in Case 2. Hence the value of the strategy on any 
report can be computed in time 0{\H\), when i? yf 0, and in constant time 
otherwise. This implies 

Theorem 1. The value of strategy IS on any report can he computed in time 
linear in the number of processes. 

The rest of this section is devoted to the proof of optimality of strategy IS. 
We first prove a lemma describing the set Cor(IS). 

Lemma 4. If the set of heavy processes is nonempty then 

Cor (IS) = {{G,v)en-. q{v) ■ I{G) > q{l - v) ■ I{G')} 

U {(G, 0) G C : p • /(G) = (1 - p) • I{G')} ; 

otherwise Cor(IS) = {(G, 0) : G C {1, ..., n}}. 

Proof. First suppose that the set H of heavy processes is nonempty. Let (G, v) G 
12 and consider four cases. 

Case 1. q{v) ■ /(G) > q{l - v) ■ I{G'). 

Let R G p{G,v). Hence, for all i G G, we have R{i) = v. Consequently, 
G C Z{R, v) and hence /(G) < I{Z{R, v)) and Z(G') > I{Z{R, 1 — v)), in view 
of Lemma 3. This implies q{v) ■ I{Z{R, v)) > < 7(1 — v) ■ I{Z{R, 1 — w)), and hence 
IS(i?) = V. Thus IS(i?) = V for all R G p{G,v). This implies (G, u) G Cor(IS). 

The arguments in the three other cases: 

Case 2. q{v) ■ I{G) = < 7(1 — v) ■ I{G') and u = 0. 

Case 3. q(v) ■ i\g) = < 7(1 — v) ■ i\g') and z; = 1. 

Case 4. q\v) ■ i\g) < g(l - v) ■ i\g'). 
are similar. 

This concludes the proof when H is nonempty. If the set H of heavy processes 
is empty, the lemma follows immediately from the definition of the strategy IS. 

□ 



Lemma 5. Rel(IS) does not depend on failure probabilities of light processes. 

Proof. If the set H of heavy processes is empty. Lemma 4 implies that Rel(IS)= 
p, which implies independence of Rel(IS) of any p^. If the set of heavy processes 
is nonempty. Lemma 4 implies the following equivalence: for any set S of heavy 
processes, any set T of light processes, and any v G {0,1}, we have {S,v) G 
Cor(IS) if and only if (S' U T, u) G Cor(IS). This implies that Rel(IS) does not 
depend on p^, for any light process i. □ 

Theorem 2. The decision strategy IS is optimal. 
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Proof. We will show that Rel(IS) > P{T), for any intersecting family X C 17. 
By Lemma 1 this implies Rel(IS) > Rel(I?), for any decision strategy D. First 
suppose that pi = 1/2 for all light processes i. Fix an intersecting family X. 
Consider two cases. 

Case 1. The set H of heavy processes is nonempty. 

Without loss of generality suppose that 1 G i7. Call a set 5 C dominating 
if either I{S) > I {H \ S) or I{S) = I{H \ S) and 1 G S'. For any dominating 
set S consider the family C(S) of all events (S U T, 0), (i7 \ S U T, 0), (S U T, 1), 
{H \ S (JT, 1), for all T C L. In order to prove Rel(IS) > P{X), it is enough to 
show that P(Cor(IS) C C(S)) > P(X n C(S)), for all dominating sets S. Fix a 
dominating set S and consider two cases. 

Subcase 1.1. p ■ I{H \S)> {1 — p) ■ I{H). 

By definition of a dominating set we have p ■ I{S) > {1 — p) ■ I{H \ S) and 
by the assumption of Subcase 1.1 we have p ■ I{P[ \ S) > {1 — p) ■ I{H). Hence, 
by Lemma 4 we have Cor(IS) C C(S) = {(S U T, 0), (iL \ S U T, 0) : T C L}. 

Now consider XnC(S). If (SUT, 1) G X, for some T C L, then {1X\SUL\T, 0) ^ 
X, by definition of an intersecting family. Similarly, if (iL \SUT, 1) G X, for some 
T C L, then (S U X \ X, 0) ^ X. However P{{H \SU L\T,0)) > P((S U T, 1)) 
and P((S U X \ T, 0)) > P{{H \ S U T, 1)). (Here we use the assumption that 
Pi = 1/2 for all light processes i.) Consequently, in X n C(S'), an event (G,v) 
from Cor(IS) C C{S) can only be replaced by an event of probability not larger 
than P{{G,v)). This implies P(Cor(IS) nC(S')) > P{X nC{S)) in Subcase 1.1. 

Subcase 1.2. p ■ I{H\S) < (1 — p) • I{Pl). 

Now we have p-I{S) > {l-p)-I{H\S) and p-I{Pl\S) < (1— p) -/(iL). Hence 
Cor(IS) n C(S') = {(S' U X, 0), (S U X, 1) : X C X}. Similarly as in Subcase 1.1, we 
conclude that an event {G,v) from Cor(IS) C C{S) can only be replaced by an 
event of probability not larger than P{{G,v)). This implies X(Cor(IS) nC(S)) > 
X(X n C(S)) in Subcase 1.2 as well. 

Case 2. The set H of heavy processes is empty. 

By our assumption, all processes have failure probability 1/2. Hence we have 
X(G, 0) = p/2” and X(G, 1) = (1 — p)/2”, for any G C {!,... ,n}. In Case 2, 
Cor(IS) = {(G,0) : G C |l,...,n}}. If (G, 1) G X, for some G C |l,...,n} then 
(G',0) ^ X, by definition of an intersecting family. Consequently, an event of 
probability p/2” from Cor(IS) can only be replaced in X by an event of smaller 
or equal probability (1 — p)/2”. This implies Rel(IS) > P(X) in Case 2. 

It remains to remove the assumption that Pi = 1/2 for all light processes i. 
Let X be any intersecting family. Let P denote, as before, the probability function 
yielded by process failure probabilities pi, where pi = 1/2 for all light processes i. 
Let P' denote the probability function yielded by process failure probabilities p', 
where p( = pi for all heavy processes i, and p( for light processes i are arbitrary 
reals from the interval [1/2, 1]. By Lemma 5 we have X(Cor(IS))= X'(Cor(IS)). 
Since X(X) is an intersecting family, the first part of this proof implies P{T{X)) < 
X(Cor(IS)). Since X(X) is also a filter. Lemma 2 implies P'{T{X)) < P{T{X)). 
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Finally, since X C T{X), we have P'{X) < P'{T(X)). Hence we get: 

P'{X) < P'{T{I)) < P{P{X)) < P(Cor(IS)) = P'(Cor(IS)) . (4) 

This concludes the proof for arbitrary values of process failure probabilities. □ 



4 Locally Optimal Decision Strategies 



An alternative way of optimizing the quality of a decision is adopting a local 
rather than a global point of view. Instead of maximizing the total probability 
of correctness of a decision strategy, as we did in the previous section, we may 
want to find the conditionally most probable value, given a report R. It is natural 
to call such a strategy locally optimal, as it locally optimizes the probability of 
a decision, given a report obtained from processes, disregarding the impact of 
such a choice on the overall probability of correctness. The phrase “conditionally 
most probable value, given a report i?” has to be formalized very carefully. Its 
meaning is not obvious, for the same reason as the meaning of probability of 
correctness of a strategy, examined in Sect. 3, was not straightforward. Since 
the behaviour of faulty processes is Byzantine rather than stochastic, the event 
corresponding to the occurrence of a given report R is not immediately clear. 
Nevertheless, we will define the notion of the probability of a value given any 
fixed report; a locally optimal decision strategy should pick, for any report R, the 
value for which this probability is larger. We will then define a locally optimal 
decision strategy which computes the decision value in time linear in the number 
of processes. It is natural to ask if these two ways of optimizing the decision (the 
one presented in Sect. 3 which could be called global optimization and the local 
optimization discussed in this section) are equivalent. The surprising answer is 
no. We show that, in general, local maximization of probability may lead to a 
decision strategy which does not have the highest probability of correctness. 

Fix any report R. Recall that Z{R, v) = {i G {1, ..., n} : R{i) = u}. Given the 
report R, the probability that value v was to be transmitted, is the probability of 
the following event: b = v and all fault-free processes reported v. More formally: 
P{v\R) = P{{{G,v) : G C Z{R,v)}). Hence we adopt the following definition. 
A decision strategy D : TZ — > {0, 1} is called locally optimal, if 



D{R) = 



0 if P{{{G, v):GC Z{R, 0)}) > P{{{G, v) : G C Z{R, 1)}) 

1 otherwise. 



( 5 ) 



Since P{{{G,v) : G C Z{R,v)}) = q{v) ■ rijG2(fl,i-«) (unique) locally 

optimal strategy Loc: TZ — > {0, 1} can be defined as follows: 

l^oc(R) = / ^ ■ riiez(fl.i) P* > (1 - P) • n*ez(fl,o) P» • /g) 

^ ^ (1 otherwise. ' ^ 

The following result follows from the definition of strategy Loc. 



Theorem 3. The decision strategy Loc is locally optimal. Its value on any report 
can be computed in time linear in the number of processes. 
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We now present an example witnessing to the (somewhat surprising) fact 
that optimality and local optimality of decision strategies are not equivalent 
notions. 

Example 1. Consider a system in which the probability distribution of the 
Boolean value to be conveyed is uniform, i.e., p = 1/2, and there are three 
processes, 1, 2, and 3, with failure probabilities p\ = 1/4 and P 2 = P 3 = 2/5, 
respectively. Consider the report R, such that i?(l) = 1 and R{2) = R{3) = 0. 
Since p = 1 — p and 

nz(if,l)) = j^|-|>|-|4 = /(Z(/i,0)), (7) 

the optimal strategy IS gives value 1 on report R. On the other hand, since 

P({(G, 0) : G C Z{R, 0)}) = i > ^ ^ = P({(G, 1) : G C Z{R, 1)}) , (8) 

the locally optimal strategy Loc gives value 0 on report R. This shows that an 
optimal decision strategy need not be locally optimal. 

The converse is not true either. For the above described system we have 

Cor(IS) = {({l,2,3},u),({l},u),({l,2},u),({l,3},u):r;G{0,l}} , (9) 

hence Rel(IS)=0.75. On the other hand, 

Cor(Loc) = {({l,2,3},u),({2,3},u),({l,2},r;),({l,3},u):r;G{0,l}} , (10) 

hence Rel(Loc)=0.72. This shows that the locally optimal decision strategy need 
not be optimal. 

We conclude the paper with the observation that the above anomaly does not 
occur in the special case when the probability distribution of the Boolean value 
to be conveyed is uniform and all processes have the same failure probability less 
than 1/2. 

Proposition 1. If p = 1/2 and all process failure probabilities are equal and 
smaller than 1/2, then the optimal strategy IS and the locally optimal strategy 
Loc are equal. 

Proof. Let p = 1/2 and let g < 1/2 be the failure probability of all processes. 
Fix any report R. Let 2 = \Z{R,Q)\ and hence n — z = |Z(i?, 1)|. We have 
I{Z{R,0)) = (1 — qYq^~^ and I{Z{R,1)) = (1 — qY~^q^. It follows that 
i\z\r,{))) > /(Z(i?, 1)) if and only if z > n — z. Consequently, IS(i?) = 0 
if and only if z > n — z. On the other hand, by definition of Loc, Loc(i?) = 0 if 
and only if q^~^ > q^ , which is also equivalent to z > n — z. This concludes the 
proof. □ 
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5 Conclusion 

We considered decision strategies based on reports of fault-prone processes, as- 
suming that fault probability distribution among processes is known, and that 
faulty processes behave in a Byzantine way. We constructed a deterministic de- 
cision strategy which has the highest possible probability of correctness. We also 
showed that a locally optimal decision strategy (that finds, for every set of re- 
sults conveyed by processes, the conditionally most probable value that could 
yield these results) need not have the highest probability of correctness. Both our 
strategies, the globally optimal and the locally optimal, compute the decision 
value for any report of the processes in time linear in the number of processes. 

Our strategies work for arbitrary fault probability distributions among pro- 
cesses, and for arbitrary a priori distribution of transmitted values, but we as- 
sume that these values are Boolean. It would be interesting to extend our results 
to the case of arbitrary finite sets of possible transmitted values, with arbitrary 
a priori probabilities. 
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Abstract. We investigate the problem of sharing the cost of a multi- 
cast transmission in a wireless network where each node (radio station) 
of the network corresponds to (a set of) user(s) potentially interested in 
receiving the transmission. As in the model considered by Feigenbaum 
et al [2001], users may act selfishly and report a false “level of interest” 
in receiving the transmission trying to be charged less by the system. 
We consider the issue of designing a so called truthful mechanisms for 
the problem of maximizing the net worth (i.e., the overall “happiness” 
of the users minus the cost of the transmission) for the case of wireless 
networks. Intuitively, truthful mechanism guarantee that no user has an 
incentive in reporting a false valuation of the transmission. Unlike the 
“wired” network case, here the cost of a set of connections implementing 
a multicast tree is not the sum of the single edge costs, thus introduc- 
ing a complicating factor in the problem. We provide both positive and 
negative results on the existence of optimal algorithms for the problem 
and their use to obtain VCG truthful mechanisms achieving the same 
performances. 



1 Introduction 

One of the main benefits of ad-hoc wireless networks relies in the possibility of 
communicating without any fixed infrastructure. Indeed, each station is a ra- 
dio transmitter/receiver and communication between two stations that are not 
within their respective transmission ranges can be achieved by multi-hop trans- 
missions: a set of intermediate stations forwards the message till its destination. 

In this work, we consider the problem of sharing the cost of a multicast 
transmission in such wireless networks. A set of radio stations implements a 
directed communication graph which can be used to broadcast a (set of) messages 
from a given source node s to any subset of users. In particular, each user j is 
sitting close to some station i and she can receive the transmission only if i 
does. In addition, user j benefits from receiving the transmission some amount 
specified by a value Vj (say, how much j valuates the transmission) . 

* Work supported by the European Project IST-2001-33135, Critical Resource Sharing 
for Cooperation in Complex Systems (CRESCCO). 
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Since transmissions along the edges (i.e., links) of the communication graph 
require some costs (i.e., the power consumption of the station forwarding the 
messages), one would like to select a suitable set of nodes, to which the trans- 
mission is sent to, so that (i) the users share the cost of the transmission, and 
(ii) the overall net worth is maximized: the net worth is defined as the sum of 
the VjS of all users receiving the transmission minus the overall cost due to the 
used links. 

Although the costs of the links are a property of the network (thus known to 
the “protocol”), the valuation Vj is clearly a property of user j. Thus, each Vj is 
a private piece of information (a part of the input) held by user j only. So, a user 
may act selfishly and report a different value bj trying to receive the transmission 
at a lower price. We thus need to design so called truthful mechanism for our 
problem, that is, a suitable combination of an algorithm A and payments to the 
users which guarantee that (i) no user j has an incentive in reporting bj yf vj, 
and (ii) the algorithm A, once provided with the correct VjS, returns an optimal 
solution. 

This problem has been previously considered in the context of “classic” wired 
networks in [15]. In this work we consider a different model, that is, the wireless 
network one. The main difference between the two models relies on the different 
cost functions, which turns out to be a key point for solving the problem above. 

Cost of Wireless Connections. Consider a directed weighted communication graph 
Q = {S, £, w) defined as follows: S is the set of stations, and Q contains a directed 
edge (i,j) G f if and only if the direct transmission from i to j is feasible; in this 
case the weight w(i,j) is the minimum power required for station i to directly 
transmit to station j. For instance, in the empty space w{i,j) = d(j,j)^, where 
d(z, j) is the Euclidean distance between i and j. 

Each station is a radio transmitter/receiver and a station i is able to directly 
transmit a message to station j if and only if the power Pi used by station 
i satisfies Pi > w{i,j). Stations use omnidirectional antennas, and a message 
sent by station i to j can be also received by every other station f for which 
w{i,j') < w{i,j). In order to reduce the power consumption, every station i 
can adjust its transmission power Pi, thus implementing the set of connections 
{(i, j)| w{i,j) < Pi}. Hence, given a set of connections C C £, its cost is defined 
as follows: 

Cost(C) = max w{i,j), (1) 

i£S 

that is, the overall power consumption required to implement all these connec- 
tions. 

Net Worth of a Multicast Transmission. We are interested in sets C C £ which 
guarantee that, given a distinguished source node s, the set C connects s to a 
suitable set D{C) C 5 of destination nodes. Consider a set U of users, each of 
them located close to some of the nodes in S. The source s can send some kind 
of transmission (say a movie or a sport event) to a user j only if j is close to 
some of the destination nodes D{C). In addition, every user j has a valuation 
Vj of the transmission representing how much she would benefit from receiving 
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it (i.e., how much she would pay for it). As in the model of [15], we consider 
the situation in which each user j is sitting close to one station, say i; the latter 
represents the router of the network at distance one hop from user j. So, user 
j can receive the transmission only if node i does. Observe that, we can always 
reduce the case of several users located close to the same node to the case of (at 
most) one user close to one node (consider each user as a node with no outgoing 
edges and one ingoing edge of cost 0). Given a solution T C £, its net worth is 
defined as 

NW(T) = Worth(T) - Cost(T), (2) 

where Worth(r) = 

The cost sharing problem asks for a T C £ that, for a given source s, maxi- 
mizes the net worth function above. 

Selfish Users and Economical Constraints. Associated to each node there is a 
selfish agent reporting some (not necessarily true) valuation bp, the true value Vi 
is privately known to agent i. Based on the reported values b = {bi, 62, ... , bn) 
a mechanism M = (A, P) constructs a multicast tree T using algorithm A (i.e. 
T = A(b)) and charges, to each agent i, an amount of money to pay for receiving 
the transmission equal to P^{b), with P = (P^, P^, . . . , P"). 

There is a number of natural constraints/goals that we would like a mecha- 
nism M = (A,P) to satisfy/meet: 

1. Truthfulness (or Strategyproofness)^. For every i, let b-i := ( 61 , 62 , ... , 
bi-i, 6 i+i, . . . , bn) and {bi, b-i) := b. The utility of agent i when she reports bi, 
and the other agents report b-i, is equal to 

,, , . _ j Vi- P'-{bi,b-f)\iT = A{bi,b-i) amdi & D{T), 

otherwise. 

We require that, for every i, for every b-i, and for every bi ^ Vi, it holds that 
Ui{vi,b-i) > Ui{bi,b-i). In other words, whatever strategy the other agents fol- 
low, agent i has no incentive to lie about her true valuation Vi. A mechanism 
satisfying this property is called truthful. 

2. Efficiency. The net worth NW(T) yielded by the computed solution T is 
maximum, that is, NW(T) = maxcc£{NW(C)}. 

3. No Positive Transfer (NPT). No user receives money from the mechanism, 
i.e., P*(-) > 0. 

4. Voluntary Participation (VP). We never charge a user an amount of 
money grater than her reported valuation, that is, ybi,yb-i bi > P’’(bi,b-i). 
In particular, a user has always the option to not paying for a transmission for 
which she is not interested. 

5. Consumer Sovereignty (CS). Every user is guaranteed to receive the 
transmission if she reports a high enough valuation. 

6. Budget Balance (BB). ’Y^iP^ib) = Cost(A(6)). 

7. Cost Optimality (CO). The set of connections T is optimal w.r.t. the set 
of receivers D{T), that is, Cost(T) = minf7c£,D(T)=D(C){Cost(C)}. 

^ In Sect. 2 we provide a more general definition of truthfulness which applies to a 
wide class of problems involving selfish agents. 
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Clearly, the Efficiency requirement implies the Cost Optimality. Unfortu- 
nately, in some cases it is impossible to achieve efficiency, so we will relax it to 
r-efficiency, that is, r ■ NW(T) > maxcc£{NW(C')}. In these cases, we will take 

into account CO and r-CO, i.e., Cost(T) < r ■ min cqe, {Cost(C)}. 

D{T) = D{C) 

1.1 Previous Work 

Power Consumption and Range Assignment Problems. The problem of comput- 
ing a broadcast tree of minimal cost for wireless networks has been investigated 
in [19,13,9,30]. In particular, in [19] the authors proved that the problem is 
NP-hard to approximate within logarithmic factors, while it remains NP-hard 
even when considering geometric 2-dimensional networks [9] . Several variants of 
this problem have been considered in [23, 12, 13,9,30,5,6, 1] (see also [10] for a 
survey). However, to our knowledge, no algorithmic solution for optimizing the 
net worth has been given so far. 

Recently, the design of truthful mechanisms for the range assignment problem 
in presence of “selfish transmitters” (i.e., selfish agents that want to minimize 
the energy their station has to use) has been investigated in [2] for the strongly 
connectivity problem, and in [3] for point-to-point transmissions, respectively. 
Mechanism Design and Cost-Sharing Mechanisms in Wired Networks. The the- 
ory of mechanism design dates back to the seminal papers by Vickrey [29] , Clarke 
[8] and Groves [18], and recently found a natural application to (algorithmic) 
questions related to the Internet [25] (see also [16] and [26]). VCG mechanisms 
guarantee the truthfulness under the hypothesis that the mechanism is able to 
compute the optimum and the optimization function is utilitarian (see Sect. 2 
for a formal definition of utilitarian problem) . 

This technique is employed in [15] (and in this work) where the authors 
consider the wired networks case. They indeed provide a distributed optimal 
algorithm for the case in which the communication graph is a directed tree. This 
yields a distributed mechanism‘s which, for this problem version, satisfies all 
requirements mentioned above (truthfulness, efficiency, etc.) except for budget 
balance. 

Noticeably, a classical result in game theory [17,28] implies that, for this 
model, budget balance and efficiency are mutually exclusive. Additionally, in 
[14] (see also Theorem 5 in [7]) it is shown that no a-efficiency and /3-efficiency 
can be guaranteed simultaneously, for any two a, P > 1. So, the choice is to either 
optimize the efficiency (as in [15]) or to meet budget balance (as in [20,21,7]). 
In the latter case, it is also possible to obtain so called group strategyproofness, a 
stronger notion of truthfulness which can also deal with coalitions. On the other 
hand, if we insist on efficiency, NPT, VP, and CS, then there is essentially only 
one such mechanism: the marginal-cost mechanism [24], which belongs to the 
VCG family. 

All such negative results apply also to our problem (i.e., wireless networks). 
Indeed, a simple observation is that every instance of the “wired” case can be 

s The mechanism is able to compute both the solution and the payments in distributed 
fashion. 
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reduced to the wireless one using the following trick: replace every edge 
with two edges (i,x{i,j)) and (x{i,j),j) with. w{i,x(i,j)) = 0 and w{x(i, j), j) = 
w{i,j). So, also for our problem we have to choose between either budget balance 
of efficiency. 

1.2 Our Results 

We consider the problem of designing mechanisms that satisfy truthfulness, ef- 
ficiency, NPT, VP, CS, and CO in the case of wireless networks. We first show 
that, even though the problem is not utilitarian, it is possible to adapt the 
VCG technique so as to obtain truthful mechanisms based on exact algorithms 
(Sect. 2). 

Unfortunately, the problem is NP-hard, thus preventing from a straightfor- 
ward use of the VCG result to obtain polynomial-time truthful mechanisms. 
Motivated by this negative result, we first consider the problem restricted to 
communication graphs that are trees (this is the analogous of the result for 
wired networks in [15]). We prove that, in this case, the optimal net worth can 
be computed via a polynomial time distributed algorithm (Sect. 3.1). The im- 
portance of this result^ is threefold: 

— It shows that the hardness of the problem is confined in the choice of a 
“good” multicast tree, and not in its use: if an “oracle” provides us with a 
tree containing an optimal multicast tree, then we can compute the optimum 
in polynomial-time. 

— It is used to obtain a truthful distributed polynomial-time mechanism satis- 
fying NPT, VP, CS, and efficiency when the given communication graph is 
a tree. In this case, both the solution and the payments can be computed in 
distributed fashion using 0 ( 1 ) messages per link. 

— It can be used to approximate the problem in some situations for which 
a good “universal” tree exists, i.e., a tree containing a set of connections 
of cost not much larger than the optimal solution and reaching the same 
set of nodes. This approach is similar to that of several wireless multicast 
protocols^ which construct a multicast tree by pruning a broadcast tree T 
(e.g., MIP, MLU and MLiMST in [13]). In all such cases, one can assume 
that the communication graph Q is the tree T. 

Moreover, we show that a shortest-path tree can be used as universal tree 
so to obtain a polynomial-time mechanism satisfying truthfulness, NPT, VP, 
CS, and 0(n)-C0, for the case of any communication graph Q. In addition, our 
mechanism guarantees |D(T*) [-efficiency, for all instances that admit an optimal 
solution T* satisfying \D{T*)\ < 7 some constant 7 < 1 (Sect. 3.2). 
We also prove that, in general, for any i? > 0, no polynomial-time algorithm 
can guarantee i?-efficiency, unless P = NP. Notice that, this result rules out the 
possibility of having polynomial-time mechanisms satisfying 0 (n)-efficiency. 

® Independently from this work, Bilo et al [4] also provide polynomial-time truthful 
mechanisms for trees in the case of wireless networks. 

In this case the set of destination nodes (termed multicast group) is given in input. 
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We then extend our positive result to a class of graphs denoted as trees with 
metric free edges (see Sect. 3.3). Our technical contribution here is a non-trivial 
algorithm extending the technique and the results for trees. 

Finally, we turn our attention to the Euclidean versions of the problem, that 
is, the case in which points are located on a d-dimensional Euclidean space and 
= d(i,j)“, for a fixed constant a > 1. We first show that the problem 
remains NP-hard even when d = 2 and for any a > 1 (Sect. 3.4). For the case 
d = 1 we provide a polynomial-time mechanism satisfying truthfulness, efficiency, 
NPT, VP, CS and CO, with the additional property of ensuring multicast trees 
of depth at most h, for any 1 < h < n—1 given in input. This result exploits the 
broadcasting algorithm in [11]. For the case d = 2, we present a solution based 
on the construction of so called Light Approximate Shortest-path Trees (LASTs) 
given in [22]. This achieves a better performance w.r.t. MST-based solutions in 
several cases. 

Due to lack of space some of the proofs are omitted. We refer the interested 
reader to the full version of this work [27] . 

2 Optimal Algorithms Yield Truthful Mechanisms 

For the sake of completeness, we first recall the classical technique to obtain 
truthful mechanisms for utilitarian problems known as VCG-mechanism [29,8, 
18]. We then show how to adapt this technique to our (non-utilitarian) problem. 

Let us first consider a more general situation in which each agent i has 
a certain type ti. The valuation of agent i of a solution X is represented by a 
function VALj(A, ti), where the function VALi(-, •) is known to the mechanism. A 
maximization problem is utilitarian if its objective function (?(•), which depends 
on the agents type vector t = {ti,t 2 , ■ ■ ■ ,tn), satisfies 



for any solution X . Each agent i can declare a different type bi to the mechanism. 
We have the following result on the VCG mechanism M = {ALG, Pvcg)- 

Theorem 1. [18] If ALG is an optimal algorithm for a utilitarian problem U, 
then the mechanism M = {ALG, Pvcg) is truthful for U. 

Let o'i{T) = 1 if i € D{T), and 0 otherwise. Notice that, simply setting 
VALi(r, Ui) := (Ji{T) ■ Vi does not satisfy the definition of utilitarian problem 
since NW(T) yf J2’i=i'^^^i{T,Vi) = Worth(T). Nevertheless, the next results 
states that the VGG technique can be adapted to our problem. The main idea is 
to initially charge each node by the cost of its ingoing edge in the tree (computed 
as in the wireless network case) so to “reduce” our problem to a utilitarian one 
(see [27] for the details). 

Theorem 2. Let A be a (polynomial-time) exact algorithm for maximizing the 
NW(-) function. Then the cost sharing problem on wireless networks admits a 
(polynomial-time) mechanism M = {A, Pa) satisfying truthfulness, efficiency, 
NPT, VP, CS and CO. 



n 




(3) 
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3 Special Communication Graphs 

Motivated by the result of the previous section, we focus on the existence of 
polynomial-time exact algorithms for the cost sharing problem on wireless net- 
works. Since the problem is NP-hard, also for Euclidean 2-dimensional instances 
(Sect. 3.4), in the following we will focus on restrictions for which such algorithms 
exist. 

3.1 Trees 

We proceed similarly to [15] and assume that the communication graph is a 
directed tree T = (5,£). 

Definition 1. For every i € T, let p(i) denote its parent in T. Also let Ci = 
w{p{i),i) and Cg = 0. We denote by % the subtree rooted at i, and by ch(z) the 
set ofi’s children. Finally, NWopt(*) denotes the optimal net worth of 7), that 
is, the optimum for the instance in which i is the source node and the universal 
tree is %. 



Algorithm Wireless_Trees at node i 

1. After receiving a message from each child j G ch(i) do 

(a) Add{j) := -Cj + Efc6ch(i),c,<c, a"; 

(b) T{i) := 0; 

(c) if maxjgch(i) Add{j) < 0 then p' := Vi 

(d) else do 

i. Add := maxjg ch(i) Add{j)-, 

ii. p’’ := Vi -h Add] 

iii. j* := arg max{j G ch{i)\Add{j) = Add}] 

iv. T{i) := {{i,j)\w{i,j) < w{i,j*)}] 

2. send p'’ to parent p(i); 



Fig. 1. The distributed algorithm for trees computing an optimal solution in bottom-up 
fashion. 



Lemma 1. For every node i, it holds that 



NWopt(i) = Ui -I- maxjO, max {-Cj -I- NWopt(A:)}}. (4) 

k^cn{i) ,Ck<Cj 



Proof. The proof is by induction on the hight h oi%. Obviously, for h = 1, since 
f is a leaf node, then NWopt(i) = Vi. Let us now assume that the lemma holds for 
any h' < h — 1, and let us prove it for h. Let T* denote an optimal solution for 
7). Let us first observe that, if NWopt(f) = NW(T*) > Vi, then T* must contain 
at least one outgoing edge from node i. Let [i,j) be the longest such edge in 
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T* . For any node fc G 7^, let T*^ denote the subtree of T* rooted at k. Since 
(i,j) G T*, then k G D{T*), for all k G ch(i) such that Ck < Cj. So, 

Worth (T^ = t;, + ^ Worth(i;%), (5) 

A;Gch(z),Cfc<Cj 

Cost(i;*) = c,- + ^ Cost{T*,). (6) 

/cGch(z),Cfc<Cj 

Let us now suppose, by contradiction, that there exists a I G ch(i), with c/ < cj 
and NW(r*j) < NWopt(Z)- Let T( denote the subtree of % yielding optimal net 
worth w.r.t. %, that is, NW(T/) = NWopt(0- Let T' be the solution obtained 
by replacing, in T*, T*^ with T[. Since T/ still contains all edges (i,k), with 
w{i,k) < w{i,j), we have that 

Worth(T/) > Worth(T/) + v,+ ^ Worth(i;%). (7) 

kGch{i) ,Ck<Cj ,k^l 

Clearly, Cost(T/) = Cost(T*) — Cost(T*j) + Cost(T/). This, combined with Eq. 7, 
yields NW(T;0 > NW(T*)-NW(T*j + NW(T/). From the hypothesis NW(i;*,) < 
NWopt(0 = NW(r/), we obtain N\A/(T/) > NW(T*), thus contradicting the op- 
timality of T*. So, for every k G ch(*) with Ck < Cj, it must hold NW(T*^) = 
NWopt(fc). From Eq.s 6-5 we obtain 

NWopt(z) = NW(T*) =v.-c,+ Y. NW(T*,) 

k^ch{i) ,Ck^Cj 

= v^-Cj+ Y NWopt(fc). (8) 

fcGch{z) ,Cfe <Cj 

The optimality of T* implies that, if NWopt(j) > Vi, then j must be the node in 
ch(i) maximizing the right quantity in Eq. 8. The lemma thus follows from the 
fact that NWopt(i) > fG taking no edges in % yield a net worth equal to Vi. This 
completes the proof. 

Theorem 3. For any communication graph T which is a tree, algorithm Wire- 
less_Trees computes the optimal net worth in polynomial time, using 0{n) total 
messages and sending 0(1) -messages per link. 

In Fig. 2 we show a distributed top-down algorithm for computing of 

Theorem 2 (see the proof in [27]). The code refers to a non-source node; the 
computation is initialized by s which, at the end of the bottom-up phase of 
Algorithm Wireless_Trees, has computed the value NWopt = and it executes 
the instructions la-le for every j G ch(s). 

Using the algorithm in Fig. 2, from Theorem 3 and from Theorem 2 we obtain 
the following: 

Corollary 1. If the communication graph is a tree, then the cost sharing prob- 
lem on wireless networks admits a distributed polynomial-time truthful mecha- 
nism M = {A, Pa) satisfying efficiency, NPT, VP, CS and CO. 
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Algorithm Wireless_Trees_Pay at node i = p{j) 

1. After receiving the message (NWopt,A“) from parent p(i), for each child j G ch(i) 
do 

(a) A^'=° := NWopt - hj) 

(b) X := maxfcgph(i),fc7ij{— Cfc + X)i6ch(i),ci<cfc A }i 

(c) A“^ := NWopt — /r* + Ui + max {0, x}] 

(d) A^' := A"-’ - A^'=°; 

(e) send (NWopt, A^) to child j; 



Fig. 2. The distributed algorithm for computing the payments in top-down fashion. 



3.2 Do Good Universal Muticast Trees Exist? 

In this section we propose an application of our optimal algorithm given in 
Sect. 3.1. In particular, we consider mechanisms that pre-compute some broad- 
cast tree T (i.e., DiT) = S) and then solve the problem by computing an optimal 
subtree T of T. Let ALG„„ denote the resulting algorithm. The following result 
is a simple generalization of Corollary 1. 

Theorem 4. There exists a payment function Pun such that = (ALG„„, 
Pun) satisfies truthfulness, NPT, VP and CS. Moreover, f/ALG„„ runs in poly- 
nomial time, then the payment functions Pun computable in polynomial time 
as well. 



The next result provides an upper bound on the approximability of the cost 
sharing problem when the input communication graph is not a tree. The idea is 
to use a shortest-path tree as universal tree. 



Theorem 5. There exists a polynomial-time mechanism Mun = (ALG„n,P„„) 
satisfying truthfulness, NPT, VP, CS and 0{l)-C0, where I = |D(T)| and T 
is the computed solution. Additionally, for any 7 < 1 and for all instances that 
admit an optimal solution T* satisfying \D{T*)\ < , Mun guarantees 

also {j^)- efficiency, with k = \D{T*)\. Thus, in this case, Mun guarantees 
0{n)-efficiency. 



The above result guarantees 0(|U(r*)|)-efliciency only in some cases. The 
next theorem rules out the possibility of obtaining polynomial-time 0{\D{T*)\)~ 
efficiency in general. Its proof is a simple adaptation of an analogous result for 
the wired case in [15]. 



Theorem 6. For any R > 0, no polynomial-time R- approximation algorithm 
(mechanism) exists, unless P = NP. 



3.3 Trees with Metric Free Edges 

We consider the case in which the set of edges of the communication graph 
G = {S,£) is partitioned into two sets T U P. Similarly to the case considered 
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in Sect. 3.1, T induces a spanning tree of Q, i.e., D{T) = S, and, for every node 

i, we must select a set of outgoing edges in the set T. However, each of these 
edge also induces a set of “additional connections for free” in the set J-, that is, 
for every (i, k) € IF, w{i, k) = 0. So, adding all such connections to the solution 
does not increase its cost. These connections are specified as follows. Let w*{i,j) 
denote the weight of the path in T connecting the node i to one of its descendants 

j. Let us define FREE(i,_;/) := {(i, A:)| w*(i, k) < w(i,j) A (i,j) G T A (i, k) ^ T}. 
Moreover, for every T let J^{T) := U(i j)gt FREE(i, j), and := 1F(T). We 
consider the restriction of the problem in which every feasible solution C C TU 
must fulfill the following property: C contains an edge (i,k) G J- ii and only if 
it also contains an edge (z,j) G T with w*{i,k) < w{i,j). In other words, no 
edge in can appear as the longest outgoing edge of a node in any solution 
C, thus implying Cost(C) = Cost(C n T). Observe that this definition captures 
some restrictions of the 2-dimensional Euclidean case (see [27] for a discussion). 

The following result allows us to apply Theorem 2 and obtain a truthful 
mechanism. 

Theorem 7. The optimal net worth, in the ease of metric free edge, of any 
given tree T can he computed in polynomial time. 

Corollary 2. The cost sharing problem on wireless networks, in the case of 
trees with metric free edges, admits a polynomial-time mechanism M = (A, Pa) 
satisfying truthfulness, efficiency, NPT, VP, CS, CO. 

3.4 Geometric Euclidean Graphs 

In this section we consider so called geometric communication graphs, that is, 
stations are located on the d-dimensional Euclidean space, the communication 
graph is a complete graph with w(i,j) := d(i, j)“, where a > 1 is a fixed constant 
and d : PA 77+ is the Euclidean distance. 

Theorem 8. The problem of maximizing NW(-) is HP-hard, even when re- 
stricted to geometric wireless networks, for any d > 2 and any a > I . 

Definition 2. Let MST(5) denote the minimum spanning tree of a set of points 
S C 77^. Given a source node s G S, MSTbrd(5, s) denotes the directed spanning 
tree obtained by considering all edges o/MST(5) downward directed from s. Let 
optbrd(5, s) denote the minimum cost among allT <G S x S such that D(T) = S. 

The following result concerns the problem of constructing a tree T minimizing 
the cost for transmitting to all nodes in S: 

Theorem 9. [9, 30] For any d > I and for every a > d, there exists a constant 
c[( such that, for any S G 77^, and for every s G S, Cost(MST(5, s)) < c[( • 
optbrd(5, s). Ln particular, = 12 [30]. 

Unfortunately, the same approximability result does not hold if the set of 
destinations is required to be a subset of S (e.g., stations that form a grid of size 
^/n X y/n and only one node with a strictly positive valuation) . 
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A slightly better result can be obtained by using so called Light Approximate 
Shortest-path Trees (LASTs) introduced in [22]: 

Theorem 10. For any k, for any /3 > 1, for any d > 2, and for any a > d, 
there exists a polynomial-time mechanism M satisfying truthfulness, NPT, VP, 
and CS. Additionally, M satisfies 0{1)-C0 whenever the computed solution T' 
satisfies D{T') = S or \D{T')\ < k. 

We now consider the problem restricted to linear networks, i.e., the Euclidean 
case with d= 1. By using the result in [11] it is possible to prove the following: 

Theorem 11. The optimal h net worthf of any given communication graph Q 
corresponding to a linear network can he computed in polynomial time. 

Corollary 3. The cost sharing problem on linear wireless networks admits a po- 
lynomial time mechanism M = (A, Pa) satisfying truthfulness, efficiency, NPT, 
VP, CS, and CO. This holds also with the additional constraint of computing 
multicast trees of depth at most h. 

Very recently and independently form this work, Bilo et al [4] also consider 
the Euclidean case and provide 0(1)-BB and 0(1)-C0 mechanisms. 
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Abstract. A:-Interval Routing Scheme (A:-IRS) is a compact routing 
method that allows up to k interval labels to be assigned to an arc. A 
fundamental problem is to characterize the networks that admit fc-IRS. 
Many of the problems related to single-shortest-path fc-IRS have already 
been shown to be NP-complete. For all-shortest-path fc-IRS, the char- 
acterization problem remains open for fc ^ 1. We investigate the time 
complexity of devising minimal-space all-shortest-path fc-IRS and prove 
that it is NP-complete to decide whether a graph admits an all-shortest- 
path fc-IRS, for every integer k ^ 3, as well as whether a graph admits 
an all-shortest-path fc-strict IRS, for every integer fc ^ 4. These are the 
first NP-completeness results for all-shortest-path fc-IRS where fc is a 
constant and the graph is unweighted. Moreover, the NP-completeness 
holds also for the linear case. 

Keywords: interval routing; compact routing, NP-completeness. 



1 Introduction 

Interval Routing is a space-efficient routing method for communication networks 
[9] . The routing table stored at each node is made compact by grouping the set of 
destination addresses that use the same output port into intervals of consecutive 
addresses. Formally, the network is modeled as a finite graph G = {V, E), where 
the set of vertices, V, represents the nodes of the network and the set of edges, 
E, represents the bidirectional links. Each edge between the nodes u and v is 
viewed as two opposite arcs, (u, v) and (u, u). A routing scheme for network G is 
to assign each arc {u, v) a subset I{u, v) C V, such that the union of the subsets 
assigned to the arcs emanating from u covers the set V — {u}. Routing is then 
performed according to the assignment I, such that at vertex u, a message will 
be sent on the arc {u,v) whose I{u,v) contains the destination of the message. 
A good interval routing scheme would try to minimize the number of intervals in 
I{u, v) over all the arcs by selecting an address mapping L : V ^{1,2,..., |U|} 
and an assignment I : E ^ 2^ such that the maximum number of intervals over 
all arcs is minimized. If each I(u,v) contains no more than k intervals under L, 
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the routing scheme, denoted hy R = (L, /), is called a k-Interval Routing Scheme 
(/t-IRS). 

The standard definition of IRS assumes a single routing path between any 
two nodes. It therefore forces the arcs emanating from a node u to be assigned 
disjoint subsets, i.e., I{u,v) 0 I{u,w) = 4> for v ^ w. The routing process with 
such an IRS is deterministic. A more flexible routing scheme, called multipath 
IRS or non- deterministic IRS [10,7], allows multiple arcs of a node to point 
to a destination; the routing process can pick an arc randomly or according to 
dynamic trafic conditions. 

There are two different models of networks. The weighted model associates 
each edge of the graph with a positive number to denote the cost of commu- 
nication along that edge; the unweighted model assumes that the cost of every 
edge is one unit. The length of a path under both models is the sum of the 
costs of the edges in the path. In practice, routing along shortest paths is desir- 
able. A shortest-path IRS always induces shortest paths. A single- shortest-path 
IRS offers a unique shortest path between any two vertices in the graph. An 
all- shortest-path IRS is a multipath IRS that gives exactly all the shortest paths 
between any pair of vertices in the graph. 

1 and \V\ being considered consecutive, the interval [a, b] with a > b denotes 
the set {i\a ^ i ^ |R|} U {fjl ^ i ^ b}. An interval [a, b] is linear if a < 6, and 
circular otherwise. An IRS using only linear intervals is a linear IRS, LIRS in 
short. An IRS is striet, denoted SIRS, if every arc (u,v) satisfies u ^ I{u,v). An 
IRS is denoted SLIRS if it is both strict and linear. 

The space efficiency of an IRS is measured by its compactness, which is de- 
fined to be the maximum, over all the arcs (u,u), of the number of intervals in 
I{u,v). The characterization of the networks that admit shortest-path interval 
routing scheme with compactness k (/c-IRS for short) is a fundamental question 
in this field (note the stress on shortest path because if the shortest path re- 
quirement is relaxed, every graph supports a single path 1-IRS). The question 
has been a subject of extensive study for many variants of the basic interval 
routing method, and under different models. Succesful work has been done for 
many special classes of graphs, including trees, outerplannar graphs, hypercubes 
and meshes, r-partite graphs, interval graphs, unit-circular graphs, tori, 2-trees, 
chordal rings, and general graphs. A summary of these and other results on in- 
terval routing can be found in [7]. For general graphs, existing complexity results 
for various models and variants are summarized in Table 1, where an entry for 
a variant of single-path IRS refers to both the strict and non-strict versions of 
the problem; an entry for all-shortest-path IRS refers to both the linear and 
non-linear versions of the problem; NPC denotes an NP-complete problem. 

All the problems related to IRS with a single shortest path are known to 
be NP-complete. The completeness for every constant k ^ 3 in the table needs 
an explanation. It follows from combining the result of [3] with that of [8], or 
with [11]. In [3], Flammini gave a polynomial time construction of graphs from 
biniary matrices such that there are at most k blocks of consecutive I’s in each 
column of the matrix under some row permutation if and only if there is a single- 
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Table 1. Complexity results on characterizations of IRS 



Shortest paths 
represented 


Compactness 

k 


Variants 


Graph model 


Unweighted 


Weighted 


Single 


fixed k — 1 


IRS 


NPC, [2] 


NPC, [2] 


LIRS 


NPC, [2] 


NPC, [2] 


fixed k — 2 


IRS 


NPC, [3] 


NPC, [3] 


LIRS 


NPC, [3] 


NPC, [3] 


fixed fc ^ 3 


IRS 


NPC, [3, 8, 11] 


NPC, [3,8, 11] 


LIRS 


NPC, [3,8,11] 


NPC, [3,8, 11] 


general k 


IRS 


NPC, [2] 


NPC, [4] 


LIRS 


NPC, [2] 


NPC, [2] 


All 


fixed k — 1 


IRS 


7 


7 


SIRS 


P. [1, 5] 


P. [1,5] 


fixed k — 2 


IRS 


7 


7 


SIRS 


7 


7 


fixed k — 3 


IRS 


NPC, this paper 


NPC, this paper 


SIRS 


7 


7 


fixed fc ^ 4 


IRS 


NPC, this paper 


NPC, this paper 


SIRS 


NPC, this paper 


NPC, this paper 


general k 


IRS 


NPC, this paper 


NPC, [4] 


SIRS 


NPC, this paper 


NPC, [4] 



shortest-path (A:+1)-IRS for the constructed graph; in [8], Goldberg et al. proved 
that for every constant k ^ 2, deciding whether a given binary matrix can be 
row permuted leading to each column has at most k blocks of consecutive I’s 
is NP-complete; in [11], we strengthened the result by showing that the same 
NP-completeness holds even if the problem is restricted to symmetric matrices. 

For the all-shortest-path IRS, only partial answers (both positive and nega- 
tive) have been given. 1-SIRS can be reduced to the consecutive ones property 
of binary matrices which can be solved in linear time [1]. Flammini et al. in [5] 
presented characterizations for 1-SLIRS and 1-LIRS. On the negative side, in [4], 
it was shown that the optimization problem of determining the minimal k such 
that a given weighted network belongs to the class of all-shortest-path k-IRS 
is NP-hard. For unweighted networks, their characterization remains open for 
all-shortest-path IRS of compactness k ^ 1 [7]. 

In this paper, we study the characterization question of all-shortest-path 
IRS under the most basic model - unweighted graph model. Specifically, we 
prove that the characterization of networks which admit all-shortest-path /c-IRS 
(linear or non-linear) for every constant k ^ 3 is NP-complete. These results 
can be easily generalized to the weighted network model, and hence significantly 
extend the related NP-completeness results of [4]. 

The rest of the paper is organized as follows. The next section gives some 
formal definitions of the IRS models and their variants and the characterization 
problems. In Section 3 we prove the NP-completeness results. In the last section, 
we give some conclusive remarks about the implications of our results. 
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2 Preliminaries 

The graphs we consider are connected, loopless, and do not contain multi-edges. 
The length of the path in the graph is the sum of the costs of the edges in the 
path (for the unweighted model this is equal to the number of the edges in the 
path). For an arc e = (u,u), S{u,v) denotes the set of those vertices which can 
be reached from vertex u through a shortest path passing through e; with this 
notation, S{u,v) ^ S{v,u). We use Adj{v) to denote the set of neighbours of 
vertex v in the graph. 

The basic idea of interval routing is to label each vertex of the graph G = 
{V,E) by a unique number from the set {1,2,...,|R|}, and then at each vertex 
V, to label each incident edge by a set of intervals over the set {1,2,..., |R|}, 
in such a way that the union of the intervals assigned to the edges emanating 
from V covers the set {1,2,...,|R|}. The routing is performed according to the 
labeling; in any vertex, a message will be sent on (one of) the edges whose label 
contains an interval including the destination of the message. Formally, we define 
an Interval Routing Scheme as follows. 

Definition 1. Let G = (V, E) be a graph. An Interval routing Scheme (IRS) on 
G is a pair (L, I) defined by: 

(1) I is a one to one vertex labeling, I : V {1, 2, . . . , |R|}, that labels the 
vertices of V; 

(2) I is an arc labeling, I : E ^ 2^ , assigning a subset ofV to each edge of E, 

such that at every vertex u € V, U{'“} = 

(3) for every x,y € V: 

(3.1) there exists a sequence of vertices x = mq, u\, . . . ,Us = y such that for 
1 ^ i < s, y € I{ui-i,Ui) — {ui-i}; this sequence is called a routing 
path induced by {I, I); 

(3.2) any routing path induced by {1,1) between x and y is a simple path of 
G, i.e., uo,ui, ... ,Us are mutually different vertices of V. 

To save space in its routing table, an IRS expresses I{u,v), the subset of V 
assigned to an arc e = (it, v), with intervals over |1,2,...,|R|}. 

Definition 2. An interval of {1,2, ... , |R|} is one of the following: 

(1) A linear interval [i, j] = {i,i + 1, . . . ,j}, where i,j G (1, 2, . . . , |R|} and 

i < j; 

(2) a circular interval [i,j] = {i, . . . , \V\, 1 , . . . where i,j G (1, . . . , |R|} and 
i > j; or 

(3) the null interval [ ] which is the empty set (j). 

For simplicity, we will not always strictly distinguish between a vertex v and 
its label L{v), and will say that a vertex u G R is contained in an interval [i,j] 
if L{v) G [i,j]. 

Definition 3. Given U C. V and a labeling I of V , we denote by N{L, U) the 
minimum number of disjoint intervals such that their union is equal to {L{v)\v G 
U}. 
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For example, if L{vi) = i then -/V(L, {us, vq, U 7 , ug}) = 2, because {5, 6, 7, 9} = 
[5,7] U [9,9]. {ui, U 2 , U 5 , ue, 1 ^ 7 , ug}), however, depends on whether the cir- 
cular interval is allowed; it is 2 if yes, 3 otherwise. Apart from this use of the 
circular interval in expressing I(u,v) as intervals, I(u,v) itself may or may not 
be allowed to include the starting vertex u. These restrictions give rise to the 
following basic variants of IRS. 

Definition 4. Let R = {L,I) be an IRS on a graph G = (V,E); we eall R an 

(1) Strict Interval Routing Scheme {SIRS) if for every arc {u,v) & E, u ^ 
I{u,v); 

(2) Linear Interval Routing Scheme {II RS) if for every arc (u,u) € E the 
intervals representing I{u,v) are restricted to be linear; 

(3) Strict Linear Interval Routing Scheme {SLIRS) if it is both an SIRS and 
an LIRS. 

When u G I{u,v), there is an interval on edge (u, v) that contains the starting 
vertex u. An interval on an arc leaving a vertex containing that vertex is called 
an inner case interval; otherwise, it is strict. 

Definition 5. Let R = (1,1) be an IRS {SIRS, LIRS, SLIRS, respectively) on 
a graph G = {V,E). The compactness of R is the minimum integer k such that 
for every arc {u,v) G E, N{L,I{u,v)) ^ k. We denote by k-IRS {k-SIRS, k- 
LIRS, k-SLIRS, respectively) every IRS {SIRS, LIRS, SLIRS, respectively) of 
compactness no more than k. 

In practice, we are interested in designing an IRS that induces only the 
shortest paths. 

Definition 6 . Let R = {!,!) be an IRS (respectively SIRS, LIRS, SLIRS) on 
a graph G = {V, E); we call R 

(1) a single-shortest path IRS (respectively SIRS, LIRS, SLIRS) if it induces 
one and only one of the shortest paths between every pair x,y G V ; or 

(2) an all-shortest-path IRS (respectively SIRS, LIRS, SLIRS) if it induces ex- 
actly the set of all possible shortest paths between every pair x,y gV . 

By the definitions, for all-shortest-path IRS, the arc labeling / does not 
have much flexibility in assigning subsets to arcs. I{u,v) is either S{u,v) or 
S{u,v) U {u}. For all-shortest-path SIRS, the arc labeling / has no freedom but 
being identical with S, i.e., I{u,v) = S{u,v) for every arc {u,v). 

Given a graph G and an integer k, the problem of determining whether G 
supports an all-shortest-path /c-IRS can be formally defined as follows. 

The all-shortest-path k-IRS problem: 

Instance: A graph G, and a positive integer k. 

Question: Is there an all-shortest-path fc-IRS for G? 

The all-shortest-path /c-SIRS problem: 

Instance: A graph G, and a positive integer k. 

Question: Is there an all-shortest-path fc-SIRS for G? 



272 



Rui Wang, Francis C.M. Lau, and Yan Yan Liu 



The problems for all-shortest-path fc-LIRS and fc-SLIRS are defined similarly. 
Clearly, all of these problems are in the class of NP. In fact, given a graph G 
(with or without a cost function), an integer fc, a vertex labeling L, and an arc 
labeling I, it can be verified in polynomial time whether {L, I) is an all-shortest- 
path fc-IRS (respectively fc-LIRS, fc-SIRS, fc-SLIRS) for G. 

In this paper, we prove the above problems to be NP-complete; in particular, 
the all-shortest-path fc-IRS and fc-LIRS are NP-complete for every constant fc ^ 
3, and fc-SIRS and fc-SLIRS are NP-complete for every constant k ^ A. The proof 
is based on a polynomial transformation to these problems from the following 
NP-complete problem. 

Definition 7. The Consecutive Block for Symmetric Matrices (k- CBS for short): 
Instance: An n x n symmetric binary matrix M , and an integer fc > 0. 

Question: Is there a permutation of the rows of M such that for each column j 
the number of blocks of consecutive I's {i.e., the number of entries such 
that M[i,j] = 1 and either M[i -|- l,j] = 0 or i = n) is at most k? 

In [II], we proved that fc-CBS is complete for every fixed fc ^ 2. 

In the next section, we show a construction of graphs from symmetric ma- 
trices satisfying that the matrices can be row permuted such that each column 
has no more than fc blocks of consecutive I’s if and only if the constructed 
graph supports an all-shortest-path (fc -|- 1)-IRS ((fc -I- 2)-SIRS). In the discus- 
sion section, we will point out that the tranformation can start from a similar 
NP-complete problem in [4] to prove the NP-completeness of all-shortest-path 
fc-IRS for general integer fc, but not constant fc. 

3 NP-Completeness Results 

Starting with any instance of fc-CBS, {Mnxn, fc), where M is a symmetric binary 
matrix, we will construct a graph G = (V, E) such that there is a row permuta- 
tion on M leading to each column having no more than fc consecutive I’s blocks 
if and only if G supports an all-shortest-path (fc -|- 1)-IRS. The construction is 

simple. For each row i of M, create a set Ri = ri_ 2 , ■ ■ ■ , fi^k+i} of fc -I- 4 

vertices in G, which we call row vertices; for each column j of M, create a set 
~ ■ ■ ■ , Cj, 2 n(fe-i- 4 )-i-i} of 2n(fc -I- 4) -|- 1 vertices in G, called column 

vertices; these two types of vertices induce a bipartite subgraph of G, and Ri — Cj 
edges exist if and only if M[i,j] = 1- Finally, add a new vertex a to G to link all 
the other vertices so that the diameter of G is at most 2. Formally, G = (V, E) 
is obtained as follows (refer to Figure 1, where for simplicity, a line between Ri 
and Cj represents an edge set, {{rij,Cj^h)\rij £ Ri, cj^h £ Cj})- 

V = R + C + {a}, where 

R= [J .Ri and Ri = {rij\l < Z < fc -I- 4}; and 

C= (J Cj and Cj = {cj^h\^ ^ h ^ 2n{k + A) + 1}-, and 

l^j^n 
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Fig. 1. The transformation graph (s = A: + 4 and t = 2n{k + 4) + 1) 



E — { 5 |rj ; G Ri^ ^j,h € ^ j t 

U{{a,n^i)\ri^i € R}U {{a, Cj^h)\cj,h G C}. 

Note that any two vertices ri^i and ;/ of Ri have identical neighbour sets 
Adj{ri i) = Adj{ri ii), and so do any two vertices Cj^h and Cj^h' of Cj. By R{Ri) we 
denote the set of those column vertices that are neighbours of rij G Ri- Clearly, 
r(R^) = Adj{n^i) - {a} = {cj^h\{ri^i,Cj^h) G E} = Um[ij]=iQ- Similarly, we 
use r{Cj) to refer to the set of those row vertices that are linked with cj^h G Cj. 
Then, R{Cj) = UM[ij]=i 

As the diameter of G is 2, for each arc e = (u, u), S{e), the optimally reachable 
vertices from u via e, is Adj{v) — Adj{u) + {u} — {u}. We summarize in the next 
proposition the S{e) for various types of arcs e in G. 

Proposition 1. In the transformation graph G = (V,E), for 1 ^ j ^ n, 1 ^ 
I Gi k \ Gi h Gi ‘2.ji{k -t“ 4) -t“ 1, 

S{a, r^j) = {n_z}; 

*S(n, 

S{rii^ a) = {a} + R — {rij} + C — E{Ri); 

.hf} — “t" T(Cj') 

S{cj^h: fl) = {a} + C — {cj^h} + R — r{Cj); 

, 7*2 / ) — E(^Ri) 

In the above, and in the following, we use + and — respectively for the union 
and the difference operation on sets. 
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Lemma 1. There exists a permutation of the rows of the symmetric matrix 
Mnxn that would result in a matrix having at most k blocks of consecutive l^s 
per column if and only if the graph G obtained by the above transformation from 
(Mnxnjk) supports an all-shortest-path (k+l)-IRS. 

Proof. Suppose first that there exists a permutation tt of the rows of Mnxn that 
leads to at most k blocks per column. Without loss of generality, assume irfi) = i; 
then the (/c + l)-IRS can be constructed as follows. The vertex labeling L is such 
that L{vig) = {i — l)(fc + 4) + 1 for each row vertex riy G R; L{a) = n(fc + 4) + 1; 
and L{cj^h) = n{k + 4) + 1 + (j — l){2n{k + 4) + 1) + /i for each column vertex 
Cj^h G C. That is, the vertices are ordered in such a way that each Ri C R a.s 
well as R itself form one interval, and so does each Cj C C and C; vertex a is in 
the middle and adjacent to the last vertex rn^k +4 of R and the first vertex ci^ 
of C, as depicted below. 

i?l R2 Rn C2 Cji 

^1,1 ■ ■ ■ ^2,1 ■ ■ ■ ; ^2,s ■ ■ ■ ^n,l ■ ■ ■ ^n,s ^ ^1,1 ■ ■ • ^2.1 • ■ ■ ^2,t ■ ■ ■ ^n,l ■ ■ ■ ^n,t 

R C 

Because each R^ under labeling L forms a single interval, for each column j, 

N{L,P{C,))=N{L, U R,) = N{n,{i\M[i, 3 ] = l}) 

= the number of consecutive I’s blocks in the j’th column of M 
^ k. 

For a similar reason and because of the symmetry of M, for each row i, we 
have 



N{L,r{R,))=N{L, U C,) = N{7T,{j\M[t,j] = l}) 

M[i,j] = l 

= the number of consecutive I’s blocks in the Tth row of M 
^ k. 

Concerning the arc labeling I for all-shortest-path IRS, on each arc e = (u,v), 
there are only two alternatives, setting I{u,v) to S{u,v) or to S{u,v) + u. We 
make the choice for different arcs that would guarantee each arc receiving no 
more than A: -|- 1 intervals. 

At vertex a: Let I{a,ny) = S{a,r^ 3 ) = {rty} and I{a,Cj^h) = S{a,Cj^h) = 
{cj^h}- Obviously they receive one interval. At other vertices, we will use the 
inner case. 

At vertex Vi^: On arc {r^y,a), let I{riy,a) = S{riy,a) + {r^^} = {a} -I- i? -I- 
C-r{Ri) = V -r{Ri); then N{L, I{riy,a)) < A(T(i?i)) + 1 ^ k + 1. On 
arc {riy,Cj^h), let I{riy,Cj^h) = S{riy,Cj^h) + {riy} = {cj,h} + r{Cj)\ then 
N (L, I{rig, Cj^h)) ^ 1 + kd{r{Cj)) ^ A: -|- 1. 
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At vertex Cj^h- Similarly for the last case, let I{cj^h, a) = S{cj^h, a) + {cj^h} = 
V - r{Cj) and let = S{cj^h,n,i) + {cj^h} = {ri,i} + r{Ri); then 

both kinds of arcs receive at most k + 1 intervals. 

The only i/part of the lemma is proved. 

Suppose conversely that G admits an all-shortest-path (k+l)-IRS, {L,I). 
Consider a permutation tt on {1, 2, . . . , n} induced from L in such a way that 

Tr{i) < 7r(j) <1=^ mm{L{v)\v € Ri} < min{L(u)|r; € Rj}- 

We need only to justify that, for every 1 < j ^ n, N{-K,{i\M[i,j] = 1}), the 
number of blocks in the j’th column of M when its rows are permuted using tt, 
is no more than k. 

For 1 ^ I ^ n, let mini?i denote the minimum vertex in Ri, i.e., L(minRi) = 
mm{L{riy)\riy G R^}. If M[fy,j] = 1 and M[z 2 ,j] = 1, and in \JMli,j]=i^i 
under L, the two vertices mini?,^ and min Ri^ belong to the same interval, then 
in the matrix M permuted according to tt, the two rows i\ and fy must belong 
to the same block of consecutive I’s. Thus we have N{TT,{i\M[i,j] = 1}) ^ 
Ri) = N{L, r{Cj)). So we need only to prove that for every 
1 < j ^ n, N{L,r{Cj)) < k. 

Let us first show that for any column j, N{L, r{Cj)) ^ fc + 3. Otherwise, 
there would be some j such that N{L, r{Cj)) > k + 3. Let us consider the 
interval number N{L,I{riy,Cj^h)) on an arc (riy,Cj^h) at a vertex rij € R{Cj). 

is either S{riy,Cj^h) or S{ny,Cj^h) + {n,i}- If it is S{ny,Cj^h), then 

N{L, = N{L, {c,, 4 + T(Q) - Kfy) ^ N{L, r{C,)) - 2 > fc + 1. 

If it is S{riy,Cj^h) + {n,i}, then 

N{L,I{r,y,Cj^h)) = N{L, {c,. 4 + r(Cfy) ^ N{L,r{Cj)) - 1 > fc + 2. 

Both cases imply that under the labeling L the edge {riy,Cj^h) must receive more 
than fc + 1 intervals, contradicting that (L, I) is a (fc + 1)-IRS for G. 

Now, we prove that for any column j, N{L,r{Gj)) ^ fc. Supposing that it 
is not the case, then there must be a j such that N{L, FiCj)) ^ fc + 1. In this 
case, we can show that there exists an riy € r(Cj) and a Cj^h € Gj such that the 
arc {ri i,Cj^h) at vertex Viy receives at least fc + 2 intervals, contradicting that L 
is a vertex labeling of the (k+1 )-IRS. 

Since \Gj \ = 2n(fc + 4) + 1 = 2|i?| + 1 ^ 2|T(Cj)| + 1, and each riy G R{Cj) is 
adjacent to no more than two vertices of Cj (with respect to the order defined by 
the labeling L), there must be a vertex Cj^h G Cj such that L{cj^h) is adjacent to 
no labels of the vertices in r{Cj). In the set S{rij, Cj^h) = {cj,h\ + r{Cj) — {rij}, 
under the labeling L, Cj^h itself has to form a single interval. If I{rij,Cj^h) is 
Cj^h) + then the interval number of arc (r^^/, Cj^h) is 

N{L, /(r,,p c,, 4) = N(L, {c,, 4 + r(Cfy) = 1 + N{L, r{Cj)) ^ fc + 2, 

which is a contradiction. If I{rij,Cj^h) is S{rij,Cj^h), by further selecting an 
appropriate ; from r{Cj) we can arrive at another contradiction. Because 



276 



Rui Wang, Francis C.M. Lau, and Yan Yan Liu 



there are at least /c + 4 vertices in r{Cj) and all of the vertices of r{Cj) are 
distributed among the N{L, r{Cj)) ^ fc + 3 intervals, there must be an interval 
[L{ri^i),L{rii^i')] of r{Cj) under L having at least two vertices. Selecting the 
boundary vertex in this interval we have N{L, r{Cj) — {rij}) = N{L, r{Cj)). 
Hence the number of the intervals assigned on edge is 



N{L,I{n,i,Cj,h)) = N{L,{cj,h} + r{Cj) - K,}) 

= 1 + N{L, r{Cj) - {nj}) = 1 + N{L, r{c,)) 

^ /c + 2, 

which again is a contradiction. The z/part of the lemma is proved. □ 

It is easy to see that the above transformation can be performed in polynomial 
time. By the NP-completeness of fc-CBS for every fixed k ^ 2, the next theorem 
follows. 

Theorem 1. Given a network G, the problem of deciding if there exists an all- 
shortest-path k-IRS for G is NP-complete for every fixed A: ^ 3. 

Note that, in the only z/part of the proof of Lemma 1, the {k + 1)-IRS for G 
derived from the permutation of the rows of M is linear. Thus, the following 
holds. 

Theorem 2. Given a network G, the problem of deciding if there exists an all- 
shortest-path k-LIRS for G is NP-complete for every fixed A: ^ 3. 

In the transformation, if we let each Ri contain 2A: + 7 vertices and each 
Cj contain 2n{2k + 7) + 1 vertices, we will be able to find one of the intervals 
of P{Cj) under L containing at least three vertices. Selecting an intermediate 
vertex from such an interval, a vertex rig € r{Cj) can be found satisfying 
N{L,r{Cj) — = N{r{Cj)) + 1. Note that in all-shortest-path SIRS, the 

arc labeling L is identical to S. By a similar argument one can show that there 
exists an all-shortest-path strict {k -I- 2)-IRS for G if and only if there exists a 
permutation for M leading to at most k consecutive I’s blocks per column. 

Theorem 3. Given a network G, the problem of deciding if there exists an all- 
shortest-path k-SIRS (k-SLIRS) for G is NP-complete for every fixed k ^ 4. 



4 Discussions 

We have proved that to recognize networks that admit all-shortest paths A:-IRS 
(A:-SIRS) for every A: ^ 3 (A: 4) is NP-complete for unweighted graphs (and 

of course also for weighted graphs) . Our transformation takes advantage of the 
symmetry of the matrix in the instance of the NP-complete problem A:-CBS. For 
general binary matrices. Booth and Lueker [1] gave a linear algorithm for A; = 1; 
Goldberg et al. [8] proved NP-completeness for every fixed k ^ 2; Flammini et 
al. [4] showed the same for general k even if the matrices are restricted to each 
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row having no more than k blocks of consecutive I’s (although that is not stated 
explicitly in their paper). 

Note that if we apply our transformation to an arbitrary binary m by n ma- 
trix, a graph will be constructed such that the matrix has no more than k blocks 
in each column and no more than I blocks in each row if and only if the con- 
structed graph supports an all-shortest-path (max{/c, ^}-|-l)-IRS ((maxjfc, l}+2)~ 
SIRS). Thus, the transformation can start from an instance of Flammini’s NP- 
complete problem in [4] to prove the NP-completeness of all-shortest-path k-IRS 
(and its variants) for general (but not constant) integer k. 

The results of this paper clearly imply that the optimization problem of 
determining the minimal k such that a given network supports an all-shortest- 
path fc-IRS (or its variants) is NP-hard. They also imply that we cannot in 
polynomial time approximate the compactness of IRS (SIRS) within a ratio of 
less than 4/3 (5/4), unless P=NP. 

The stretch factor of a fc-IRS i? on a graph G is defined as the smallest real s 
such that for every pair of vertices, x, y, the length of the routing path induced 
by R from x to y is at most s times the distance in G between x and y. Since 
the construction in our transformation is a graph of diameter 2, it follows that 
if PyfNP then for every /c ^ 3 (A: 4) we cannot in polynomial time determine 

if a given graph supports an all-path fc-IRS (fc-SIRS) of stretch factor less than 
3/2. 

Some remaining open problems: What is the time complexity when fc = 1,2 
(fc = 2, 3) for all-shortest-path fc-IRS (fc-SIRS)? 
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Abstract. We study an on-line parallel job scheduling problem, where 
jobs arrive over list. A parallel job may require a number of machines for 
its processing at the same time. Upon arrival of a job, its processing time 
and the number of requested machines become known, and it must be 
scheduled immediately without any knowledge of future jobs. We present 
a 8-competitive on-line algorithm, which improves the previous upper 
bound of 12 by Johannes [8]. Furthermore, we investigate two special 
cases in which jobs arrive in non-increasing order of processing times or 
jobs arrive in non- increasing order of sizes. Better bounds are shown. 

Keywords: On-line scheduling, parallel computing 



1 Introduction 

In this paper we study an on-line problem of scheduling parallel jobs. Parallel jobs 
may require more than one machine simultaneously. They are characterized by 
two parameters, processing time and size (the number of requested machines). 
Jobs {Ti, J 2 , • • • , >/n} arrive over list, which are not known in advance. Upon 
arrival of job J^, its processing time and the number of machines required become 
known, and it must immediately and irrevocably be scheduled. The next job 
Ji+i appears only after job Ji has been assigned. The goal is to minimize the 
makespan, i.e., the largest completion time over all the jobs. 

The model of scheduling parallel jobs has recently been studied extensively, 
see, e.g., [2]-[4], [10] and [11]. In many applications, a network topology is con- 
sidered, in which jobs can only be executed on particular machines. Only those 
machines connected may execute a job together simultaneously. Parallel ma- 
chines with a specific network topology can be viewed as a graph where each 
node represents a machine and each edge represents the communication link 
between the two nodes. In this paper we focus on the parallel system, called 
PRAM, whose underlying network topology is a complete graph. 
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On-Line Models. We can distinguish on-line scheduling problems by the man- 
ner that jobs arrive. In general, there are three different versions. 

1. Jobs arrive over list. Jobs arrive one by one. The next job appears only after 
the current one has been scheduled. Upon arrival of a job its size is known. 

2. Jobs arrive over time. Each job Ji has a release date which is not known 
before Ji appears. Upon arrival of job Ji its size Si becomes known. 

3. Jobs arrive on dependencies. Each job appears only after all its predecessors 
are completed, i.e., some special order in which jobs have to be started (they 
can be expressed in term of chains, trees, series parallel orders, interval order 
and so on). As soon as a job appears its size is known. However, we are not 
aware of the dependencies among jobs in advance. 

For each of the above three versions, upon arrival of a job, we may also assume 
that the job processing time becomes known (called known processing time) or 
unknown until they are completed (called unknown processing times). In this 
paper we investigate the model that jobs arrive over list with known processing 
times. 

Known Results. For the model where jobs arrive over time, Naroska et al. [11] 
presented a list scheduling heuristic with competitive ratio of 2 — 1 /m for un- 
known processing times, and pointed out it is optimal. The model where jobs 
arrive on dependencies was first studied by Feldmann et al. [4]. With unknown 
processing times, it is shown that the best competitive ratio is m, the num- 
ber of machines, even if the precedence is known in advance. Bischof et al. [1] 
considered the problem that the job processing times are equal and present a 
2.7-competitive algorithm and a lower bound of 2.691. 

Regarding the model where jobs arrive over list, to our best known, it was 
only studied by Johannes [8]. A 12-competitive algorithm was presented and a 
lower bound 2.25 was proved with an enumerating program. 

Our Contribution. In this paper, we concentrate on the model where jobs 
arrive over list and improve the upper bound given by Johannes [8]. We present 
an on-line algorithm with competitive ratio of at most 8. A lower bound of 6 is 
proved for this algorithm. Then we study two special cases, in which we know 
some a priori information about the job sizes or job processing times. The first 
case assumes that jobs arrive in non-increasing order of processing times. We 
obtain an upper bound of 2 by employing a greedy algorithm, which schedules 
jobs as early as possible. Note that if all job processing times are equal to one, 
it becomes the classical bin packing problem, for which the best known lower 
bound and upper bound are 5/3 [13] and 7/4 [12], respectively. Then we deal 
with the second case that jobs arrive in non-increasing order of sizes. We show 
that the greedy algorithm has a competitive ratio of at most 2.75. If all job sizes 
are equal to one, it is the classical on-line parallel machine scheduling problem, 
for which the best known lower bound 1.85358 and upper bound 1.9201 were 
given by Gormley et al. [7] and, Fleischer and Wahl [6], respectively. 
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Organization of the Paper. The remainder of this paper is organized as 
follows. Section 2 gives preliminaries. Section 3 presents the on-line algorithm 
with competitive ratio of at most 8. The two special cases are dealt with in 
Sections 4 and 5. 

2 Preliminaries 

An instance / of our problem consists of a list of parallel jobs { Ji, J2, • ’ ’ j Jn} 
and m identical machines. Each job Jj is characterized by size Sj, the number of 
required machines, and processing time pj. Once job Jj appears, its size Sj and 
processing time pj become known. We use (sj,pj) to denote job Jj. The efficiency 
of a schedule at any time t is defined to be the number of busy machines at time 
t divided by m. The average efficiency in disjoint intervals (oj, bi), i = 1, . . . ,1, 
is thus defined by 

i ^ 

> — 7- • / ( total number of busy machines at time t) dt. 

-a, J ^ ’ 

at 

We adopt the standard measure competitive ratio to evaluate an on-line al- 
gorithm. For any instance /, let Ca{I) and C*{I) be the makespan given by an 
on-line algorithm A and the makespan produced by an optimal off-line algorithm, 
respectively. The competitive ratio of algorithm A is defined as 

Ra = smp{Ca{I)/C*{I)}. 

I 

Algorithm A is called p-competitive if Ca (I) < pC* (/) holds for any instance I. 
A lower bound i? of a problem means that any on-line algorithm has competitive 
ratio of at least R. 

When job Jj is coming, one can schedule it in a time interval which has a 
duration at least Pj and during which the number of idle machines is at least Sj. 
The simplest on-line algorithm is the following greedy algorithm. 

Algorithm Greedy. As a job is coming, schedule it as early as possible. 

As shown in [8] algorithm Greedy has a competitive ratio of m. The bound 
can be obtained with the following example of 2m jobs. Let e > 0 be a sufficiently 
small number. The (2z — l)-stjob is (l,m-|-(i — l)£), while the 2i-thjob is (m,e), 
i = 1, . . . ,m. Algorithm Greedy processes the job one after another and gives 
a makespan of mf + m{m — l)£/2 -I- me. However, one can first process the m 
jobs of size m and then process the m jobs of size 1 each on a machine. It gives 
a makespan of to -I- (to — 1)£ -I- me. The competitive ratio goes to to as £ tends 
to zero. 

The example tells us that it is not wise to be so greedy. To improve the 
bound we may leave some space for the future jobs in case that the current 
schedule is not in a “tight” manner. Johannes [8] designed an on-line algorithm 
alone this line, in which the time axis is partitioned into intervals. The length 




282 Deshi Ye and Guochuan Zhang 



of the first interval I\ depends on the processing time of the first job, and the 
next intervals li always have a double length of the interval h-i- The jobs are 
scheduled in different way in some intervals. It was shown that the competitive 
ratio of the algorithm is at most 12. Note that in her algorithm the partition 
of the time interval is almost independent on the current schedule. In the next 
section we propose an improved algorithm which partitions the time axis into 
intervals relying on the current schedule. We prove that the proposed algorithm 
has a competitive ratio in between 6 and 8. 

3 An Improved On-Line Algorithm 

A job is called big if its size is larger than m/2, otherwise it is called small. No 
two big jobs can be processed at the same time. To present the algorithm more 
clearly we show first the structure of the schedule given by the algorithm. The 
schedule looks like an aligned house, which consists of rooms and walls. Rooms 
and walls appear alternatively. The rooms are constructed by small jobs, while 
the walls are constructed by big jobs. A big job is assigned to an old wall or 
builds a new wall by itself. A small item is always assigned to a room. The 
schedule starts from a wall and ends at a wall. We assume that there is a wall 
at time zero with a length (thickness) of zero. In case that the last room has no 
wall on the right side after all jobs are scheduled, we add a dummy wall with 
length of zero starting at the completion of the last completed job. Therefore 
the schedule can be described as 



Wi U i?i U W 2 U i?2 U • • • U Wn U U TTjv+i, 

where Wi denotes the i-th wall and Ri denotes the z-th room. The length of wall 
Wi is the total processing time of the big jobs involved in the wall. The length 
of room Ri is the distance between two walls, more precisely, the time difference 
between the starting time of the wall Wi+i and the ending time of the wall Wi. 
The length of the schedule is the total length of the rooms and the walls. 

A wall is open if there are no small jobs scheduled after it and a room is 
called open if there is no wall on the right side. 

Algorithm DW (Dynamic Waiting) 

1. Create a wall {W\) of length zero at time zero. 

2. If the incoming job Ji is big, i.e., Si > m/2, schedule Ji immediately after 
the open wall if such a wall exists. If there is no open wall, then there must 
be an open room. Let h be the length of the interval of efficiency less than 
1/2 in the open room and let L be the current length of the schedule. Start 
job Ji at time oi L + h. 

3. If the incoming job Ji is small, i.e., Si < m/2, find the first room which can 
accommodate Ji and schedule Ji as early as possible. If such a room does 
not exist, we create a new room for Ji following the present latest wall. 

4. If no job comes, add a dummy wall if needed and stop. 
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An Example: There are 10 machines and 8 jobs, where J\ = (4, 1), J2 = (3, 3), 
J3 = (6,1), J4 = (1,4), J5 = (1,6), Je = (1,7), J7 = (7,1), Jg = (6,1). The 
schedule by algorithm DW \s shown in Fig. 1, where Wi = [0,01, = [0,5], 

W 2 = [5,6], i?2 = [6,20] and W 3 = [20,22]. 




Fig. 1. An illustration of algorithm DW 



Lemma 3.1. Let S be a schedule by an algorithm A. If the length of the schedule 
S can be divided into two parts, one with average efficiency no less than a, 0 < 
a < 1, and another with total length of at most (3 times the optimal makespan, 
then the competitive ratio of algorithm A is at most 1/a + (3. 

Proof. Denote by x the length of the part with average efficiency no less than 
a and denote by y the length of another part. Thus Ca = x + y. Note that 
C* > ax and y < f3C* . We have Ca = x + y < {1/a + f3)C* . □ 

Theorem 3.2. The competitive ratio of algorithm DW is at most 8. 

Proof. Let C be the final schedule by algorithm DW, i.e., C = Wi U U • • • U 
Wn U Rn U Wn+ 1 - By the algorithm, all rooms consist of small jobs. We divide 
room Ri into two time intervals Ei and Fj, where in interval Ei the efficiency of 
the schedule is at least 1/2, and Ei = Ri — Ei. Since Ri consists of small jobs, it 
is easy to see that Ei precedes Ei. 

The idea to prove the bound of 8 is partitioning the time interval of the 
schedule C into two classes. In the first class the average efficiency of the intervals 
is at least 1/4 and in the second class the total length of the time intervals is 
bounded above by 4 times the longest job processing time. The theorem thus 
follows from Lemma 3.1. 

Note that the efficiency of a wall is larger than 1/2. We only need to consider 
the rooms. If all Fj = 0, the efficiency of the schedule at any time is at least 1/2, 
which implies that the competitive ratio is at most 2. 

Assume that Ffc 0 and Fj = 0 for all j < k. li k = N, all intervals but Fjv 
have efficiency at least 1/2. During the time interval Fjv each machine executes 
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at most one job. Otherwise the last job can start earlier since the efficiency of the 
schedule is less than 1/2 and all jobs in the room are small. By the algorithm, 
the length of is at most twice the longest job processing time. It proves that 
the competitive ratio is at most 4 by Lemma 3.1. 

Now we assume that k < N. Note that all time intervals before Fk have 
efficiency of at least 1/2. We consider the rooms Rj, j = k, . . . ,N. For any two 
adjacent rooms Ri and Rj {j > i), the jobs scheduled in Rj must have processing 
time larger than |Fj|, the length of Fi. Otherwise, they could have been assigned 
to room Ri. Therefore, either Ej = (h or \Ej \ > \Fi\. 

Find those Ej yf 0 for j > k+1. For each of such Ej, we have \Ej \ > |Fi|, for 
k < i < j. We want to determine some Fi (i < j) such that the average efficiency 
of Ej and Ei is at least 1 /4. Then the pair of Ej and Fi is called a match and such 
an Fi is called matched. Before we start this procedure, all Fj’s are un-matched 
for i > k. Start from the first non-empty Ej (j > k) and continue the following 
matching procedure until all non-empty Ej’s are matched. If \Ej\ > |Fj|, we put 
Ej and Fj as a match. Otherwise, \Fj \ > \Fi \ holds for all fc < i < j. We put Ej 
and the un-matched Fj with the largest index i < j together as a match, and 
thus Ei is matched. For each match, it is obvious that the average efficiency is 
at least 1/4. 

We do the matching interval by interval starting from Rk (we do not count 
Ek) until all nonempty Ej’s are matched for j > k. We will show that one of the 
following two assumptions must hold during the matching procedure. 

1. There is only one un-matched Fj. 

2. Re-index the un-matched Fj’s and denote them by Ff, F 2 , • • •. Then {Fj+i \ > 

mi 



Recall that Fj, yf 0. We start from intervals Rk. Clearly, the assumption 1 holds. 
Now we assume that one of the two assumptions is true when we deal with 
interval Rp (p > k). Now we consider interval i?p+i with the following two cases. 
Case 1. Epj .1 = 0. Consider the last un-matched Eq ^ % preceding interval 
Rp+\. The small jobs in interval Rp+i have processing times greater than |Fg| 
and the length of interval i?p+i is larger than twice of |F/| when the next wall 
Wp +2 is created. Then |F/+i| > 2\Fq\, where q < p. 

Case 2. F^p+i yf 0. If there is only one un-matched Fj so far, then either F/+i 
or Fj matches Fjp+i. Thus there is only one un-matched interval after i?p+i is 
considered. The assumption 1 still holds. If there are more than one un-matched 
Fj, without loss of generality, re-index them as F"i , F 2 , • • • , F/. By the assumption 
2, we have |F/+i| > 2|F/-|, 1 < j < g — 1. If |F/+i| > |Fjp_|_i|, then F/ and Fjp+i 
are matched and F/_|_i is un-matched. We have |F/_|_i| > |Fjj| > 2|Fjj_i| and 
now Epj_i becomes Fq. If |F/_|_i| < |ifp+i|, Fp+i and ifp+i are matched. The 
un-matched intervals remain unchanged, for interval Rp+i. 

Finally we get a list of un-matched intervals. Let x be the processing time 
of the longest job in the last un-matched interval Ft. Thus 2x > |Fj| by the 
algorithm. \^j\ < 2|Fj|. Since C* > x, we get that the total length of these 

intervals is bounded above by 4C*. Therefore the theorem follows. □ 
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In the following, we will give an instance to show that a lower bound of 
algorithm DW is 6. The instance consists of sequential jobs and block jobs, 
where a sequential job has a size of one and a block job has a size of m. 

Theorem 3.3. The competitive ratio of the algorithm DW is at least 6m/ (m + 
1) for m >2. 

Proof. Let A: be a sufficiently large integer and let e < 1/2^+^ be a small positive 
number. Let t = [(to — 2)2^'+^ /(to -I- 1)J. In the first t job groups Gi, - ■■ ,Gt, 
each consists of [to/ 2] identical sequential jobs followed by a block job. The 
processing time of each block job is e. The processing time of each sequential 
job in Gi is 1 — (t — i -I- l)e. 

The last group Gt+i of jobs consists of 2k jobs Ji, J2, • • • , J2fc, where J 2 i is 
a block job of processing time e and J 2 i-i is a sequential job of processing time 
2*“^ -I- (2*“^ — l)e, for 1 < i < fc. Clearly, P 2 i+i = 2p2i-i + e, for 1 < A < A: — 1. 
In the schedule generated by algorithm DW we have A -I- A: rooms and A -I- A: -|- 1 
walls (the length of the first wall is zero). The total length of the walls is (A-|-A:)e. 
The length of the room i is less than one, for z = 1, . . . , A, while the length of the 
A -I- z-th room is 2p2i-i, for z = 1, . . . , A:. Therefore, Gdw = A + 2^+^ — 2 + 0{e) 
(see Fig. 2, in which s denotes a sequential job and B denotes a block job). 
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Fig. 2. The schedule by algorithm DW 



In an optimal schedule, we can assign the sequential jobs of groups Gi, . . . , G* 
to TO — 2 machines, the sequential jobs Ji, J3 , . . . , J 2 k-z to a machine and the 
sequential job J 2 k-i to one machine. The block jobs are scheduled one by one 
following the sequential jobs (see Fig. 3). It shows that G* = 2^ + 0{e). 

Gdw/C* > (2'=+2-3-h(TO-2)2'=+V(TO-hl)-hO(e))/(2'=-hO(e)) ^ 6TO/(TO-hl) 

as k tends to infinity. □ 

Theorem 3.3 shows that an asymptotic lower bound of algorithm DW is 6. 

4 Non-increasing Processing Times 

In this section, we study the case that the jobs arrive in non-increasing order of 
processing times. Note that if all the job processing times are equal to one, the 
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Fig. 3. An optimal schedule 



scheduling problem is equivalent to the classical on-line bin packing problem. To 
see this, we can always assume that an on-line algorithm starts any job at an 
integral time. As far as we know, the best known lower bound and upper bound 
of classical on-line bin packing are 5/3 [13] and 7/4 [12], respectively. To make 
the paper self-contained, we give an instance to show the lower bound of 5/3. 
Let fc > 0 be a sufficiently large integer and let m = 6fc. There are three groups, 
each of which consists of 6 jobs. The jobs of first group Ii are of {k — 2, 1), the 
jobs of the second group I2 are of {2k + 1,1), and the jobs of the last group I3 
are of (3/c -h 1, 1). Clearly, C*{I\) = 1, C*{I\ U 12 ) = 3 and C*(/i U /2 U I^) = 6. 
For any on-line algorithm A, it is easy to verify that 

_ i'^.4(7i) C'a(/iU/2) U /2 U /a) ^ 

c*{h u I2) ' c*{h \JhyJhY - ' ■ 

In the following, we analyze algorithm Greedy for the special problem. Note 
that Greedy is just FF {First Fit) bin packing algorithm if all job processing 
times are equal. It is known that the (absolute) competitive ratio of FF for bin 
packing is in between 1.7 [9] and 7/4 [12]. 

We distinguish jobs as big ones and small ones as in the previous section. 

Theorem 4.1. The competitive ratio of algorithm Greedy for the on-line par- 
allel jobs scheduling with non-increasing processing times is at most 2. 

Proof. Denote by C the time interval of the final schedule by Greedy. We par- 
tition it into parts according to the efficiency. Let C—Bi, Li, B2, ■ ■ ■ , Lfe, Bk+i, 
where Bi is the continuous time interval, at anytime of which the efficiency is 
larger than 1/2, and Li is the continuous time interval, at anytime of which 
the efficiency is at most 1/2. To avoid trivial cases we assume that Li 0, for 
i = 1, . . . , fc and Bi 0, for i = 2, . . . ,k. Note that it is possible that Bk+i = 0. 
The jobs processed during an interval Li = [ai,bi] are small. No jobs start at 
any time ti G {ai,bi). If such a job exists, it would have been scheduled at time 
Qi since at that time there are at least m/2 machines available. It shows that the 
length of each Li is at most the longest processing times among all small jobs. 
Thus C* > \Li\, for f = l,2,...,fc. 
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If Bi = 0, any job scheduled after Li is big (with a size larger than mj2); 
Otherwise, it could have been assigned to interval Li. Let T be the total length 
of the (big) jobs scheduled after Li. C* > T and C* > |Li|, while Coreedy = 
\Li \ + T. It follows that the competitive ratio of Greedy is at most two. In the 
following, we assume that i?i yf 0. 

Consider L^, 1 < i < k. i?i+i must start with a big job, since, otherwise, a 
small job can be scheduled in Li. We denote the big job by Jy with processing 
time of pb- The small jobs scheduled in arrive after Jy; Otherwise it would 
be scheduled in Li since at that moment at least m/2 machines are idle. The 
processing time of any small job in is at most pt but larger than \Li\. It 
shows that pb > \Li\. Then the average efficiency over the intervals Li and Bi-\.\ 
is at least 1/2. 

Now we consider the interval Lj, together with Bi. Denote the longest job in 
Lk by Jn with processing time p„. Thus </„ is a small job and \Lk\ < Pn- At the 
moment that J„ comes, J„ can not be assigned to B\ . Note that the jobs arrive 
in non-increasing order of processing times. The jobs, which have been assigned 
to start at time zero before comes, have processing times at least p„. Then 
\B\ \ >Pn> \Lk\- We have the average efficiency over Bi and Lk of at least 1/2. 

Combining all the cases considered above, we conclude that the average ef- 
ficiency of the schedule is at least 1/2, which implies the competitive ratio of 
algorithm Greedy is at most 2. □ 

5 Non-increasing Job Sizes 

In this section, we deal with the case that jobs arrive in non-increasing order 
of sizes. The classical on-line scheduling problem which jobs require exactly one 
machine is a special problem of our case, if we assume that the job sizes are equal 
to one. To our knowledge, the currently best lower bound and upper bound are 

1.85358 [7] and 1 -I- < 1.9201 [6], respectively. In the following, we first 

analyze algorithm Greedy^ which is identical to LS (List Scheduling) for the 
classical schedule problem. We show that it is 2.75-competitive and present a 
lower bound of 2.5. 

Theorem 5.1. Algorithm Greedy is 2.7b -competitive. 

Proof. Consider the schedule generated by algorithm Greedy. Let x be the length 
of the schedule during which a big job is processed. Then C* > x. Denote by t 
the starting time of the last job Jn with processing time which determines the 
makespan. lit = x, we get that Ccreedy < 2C* . Consider the time interval [x, t). 
The jobs starting in this interval are small. If the efficiency at some time g in [x, t) 
is less than 2/3, any job starting at or later than g must have a size larger than 
to/ 3. Then the size of any earlier job starting in the time interval [x, g) is larger 
than to/ 3 too. It shows that two such jobs can be scheduled together and the 
efficiency must be larger than 2/3. It is a contradiction. Therefore, the efficiency 
in the interval [x, t) is at least 2/3. Let y = t — x. We have G* > x/2-\- 2y/3 and 
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C* > Pn- On the other hand, C Greedy = x + y + Pn- The theorem is proved with 
the following cases: 

Case 1: y < jx. In this case, Coreedy < jx + Pn < ^C* . 

Case 2: y > |x. In this case, jC* > ^x + ^y = x + y + ^{y — jx) > x + y. 
Combining with C* > Pn, we get C Greedy □ 

Theorem 5.2. The competitive ratio of algorithm Greedy is at least 2.5. 

Proof. Let /c > 0 be a sufficiently large integer. Let to = 3 • 2^ and N = 2^“^. 
We are given 6 job groups {Gi, G 2 , • ■ • , Ge}. Group Gi consists of N jobs of 
(3 • 2^=-i + N, 1), (3 • 2'=-! + IV - 1, 1), • • • , (3 • 2'=-! + 1, 1). G 2 consists of two 
identical jobs of (3 • l/N). G 3 has N jobs of (3 • 2'="i - 1, 1/N), (3 • 2^=-i - 

2, 1/A^), • • • , (3-2^“^ — Y', 1/fV). G 4 has N identical jobs of (2^ + 1, 1). G 5 consists 
of three identical jobs of (2^, 1). Finally, Gq has only one job of (l,iV + 1/iV). 
Clearly the jobs arrive in non-increasing order of sizes. 

By algorithm Greedy, the jobs in Gi are processed one by one. Then the two 
jobs in G 2 start at time N. The i-th job in group G 3 starts at time N — i, for 
1 < i < N. In the time interval [0,7V], the longest idle time of the machines is 
1 — 1/A^. Then the G 4 jobs are scheduled in the interval [7V-|- 1/A^, 3A^/2-|- 1/7V], 
where two G 4 jobs are processed together at each unit time. The three G 5 jobs 
start at time 3A^/2-|-l/7V. The last job (from Gq) can only start at 37V/2-I-1-I-1/7V. 
Thus GGreedy = 57V/2 -|- 1 -|- 2/N (scc Fig. 4). 




Fig. 4. The schedule by algorithm Greedy 



However, we can schedule jobs of Gi, G 4 and Ge by time N + 1/N. Each 
two of the jobs of size 1/N (G 2 and G 3 jobs) can be processed at the same time. 
The three jobs of G 5 are scheduled together at time A^-|-2/iV-|-l. It implies that 
G* < TV -b 2/iV -b 2 (see Fig. 5). Thus, 

GGreedy/C* > 5/2, 



as k tends to infinity. 



□ 




On-Line Scheduling of Parallel Jobs 289 





The following semi greedy algorithm can be shown to have a competitive 
ratio of 2.5. Due to the page limit we omit the proof. 

- Schedule the big jobs (of size larger than m/2) one after another. Let b be 
the largest completion time among these jobs. 

- Schedule the medium jobs (of size at most m/2 but larger than m/3) from 
time b with algorithm Greedy. 

- Schedule the small jobs (of size at most m/3) with algorithm Greedy. 

Note that this algorithm differs algorithm Greedy in that it does not schedule 
medium jobs together with big jobs even there are a sufficient number of available 
machines. 

Final Remarks. There is still a big gap between the upper bound and the 
lower bound for the general case. Although we feel that the lower bound 2.25 [8] 
is far away from the tight bound it seems quite difficult to improve it. We can 
construct a bad example for any on-line algorithm but not able to find a good 
bound due to the high computation time cost. It might be possible to improve 
our upper bound by refining the algorithm DW . But the analysis will get much 
more complicated. The most interesting question is the average case analysis for 
the problem. In this sense the greedy algorithm would perform much better. 
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Abstract. In this paper we study the range assignment problem in 
static ad-hoc networks on metric spaces. We consider the /i-strong con- 
nectivity and h-broadcast problems on trees, high dimensional Euclidean 
spaces and general finite metric spaces. Both homogeneous and non- 
homogeneous cases are explored. We show that the h-broadcast problem 
is polynomial solvable on trees and present an 0(n^)-approximation algo- 
rithm for the /i-strong connectivity problem on trees, where n is the num- 
ber of stations. Furthermore, we propose a probabilistic 0(log n log log n)- 
approximation algorithm for the h-broadcast problem and a probabilistic 
0(n^ log n log log n)-approximation algorithm for the /i-strong connec- 
tivity problem on high dimensional Euclidean spaces and general met- 
ric spaces. In the case of high dimensional real normed spaces, if the 
distance-power gradient a < 1 + 0(log log log n/ log log n), an 0(log“ n)- 
approximation algorithm and an 0{n^ log“ n)-approximation algorithm 
are developed for the /i-broadcast problem and the /i-strong connectivity 
problem, respectively. They are the first algorithms for the range assign- 
ment problem in static ad-hoc networks on general metric spaces. And 
the approximation ratio of O (log n log log n) for the /i- broadcast problem 
on general metric spaces is close to the known lower bound O(logn) [19]. 



1 Introduction 

Nowadays the ad-hoc wireless network based on multi-hop plays an increasingly 
important role in communication [17]. In static ad-hoc networks, the locations 
of stations are fixed. Typical applications of the static ad-hoc network model 
include emergencies, battlefield, monitoring remote geographical region and so 
on. Two of the main benefits of multi-hop ad-hoc networks are the reduction 
of the power consumption and the ability to vary the power used in the trans- 
mission such that the interference problem can be avoided. Minimizing power 
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consumption is crucial on such networks, since the wireless devices are portable 
and benefit only of limited power resources. And the range assignment problem 
is a key issue to minimize the power consumption, which has been extensively 
studied in wireless network theory. 

The problem is defined as follows. We are given a set S of stations (radio 
transmitter/receivers) on a metric space, which can communicate with each other 
by sending/receiving radio signals. In general message communication happens 
via multi-hop transmission, i.e., a message is sent to its destination through some 
intermediate stations. A range assignment for S' is a function r : S ^ IR”'’, which 
yields a directed communication graph Gr = (S,E), such that for each pair of 
stations u and v there exists a directed edge {u, u) G if if and only if u is at a 
distance at most r{u) from u. The cost (overall power consumption) of a range 
assignment r is defined as cost{r) = c(u)(r(u))“, where c(u)(r(u))“ is the 

cost of station v and c{v) is an coefficient depending on the station. When c{v) 
is a constant for all v G S, we call it the homogeneous case. Otherwise it is the 
non-homogeneous case. The distance-power gradient a is usually a real number 
in [1, 6] in practice. The goal of the problem is to find a range assignment r for S 
such that Gr satisfies a given property II and cost(r) is minimized. The detailed 
description of the problem can be found in [19]. 

Usually the property 77 is one of the follows: 

— h-strong connectivity, from every station s to any other t, Gr must contain 
a directed path of length at most h. 

— h-broadcast: Gr must contain a directed source spanning tree rooted at a 
given source s with depth at most h. 

Previous researches mainly focus on the problems defined on one dimen- 
sional spaces, i.e., networks that can be modelled as sets of stations located 
along a straight line. Polynomial time algorithms by dynamic programming were 
addressed for both homogeneous and non-homogeneous cases for 7-broadcast 
problems on one dimensional spaces in [16, 10, 1]. For the 7-strong connectivity 
problem an algorithm of dynamic programming for homogeneous case to obtain 
an optimal solution running in 0{n^) time was presented in [16] when h = n — 1 
is fixed. Unfortunately, for general h G {l,...,n — Ijitis still open whether 
the 7-strong connectivity problem on one dimensional spaces can be solved in 
polynomially. dementi et al. in [7] provided a 2-approximation algorithm for 
any 7 > 1 for homogeneous case, which runs in 0(7n^) time. There are only few 
results in the case of more general spaces. When 7 = n — 1 is fixed, minimum 
spanning tree based algorithms can deliver a solution with constant approxima- 
tion ratio for the broadcast problem [6], and the approximation ratio is 2 for 
the strong connectivity problem [16] for the case of high dimensional Euclidean 
spaces. In [9], the 7-strong connectivity problem was proved in AwAVX for any 
fixed 7 > 0 and the problem is AT^fb-hard on d dimensional Euclidean spaces for 
d > 3. The lower bound for the 7-broadcast problem on general metric spaces 
is O(logn) [19]. Up to our knowledge, there is no approximation algorithm for 
the range assignment problem on general spaces with a general 7 > 1 with a 
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bounded or input size dependent ratio. Some results about restricted version on 
high dimensions can be found in [8, 19]. 

However, in real application the problem with h = n— 1 is not practical as too 
many hops can reduce the quality of communication and also be hard to control. 
Thus we need to consider the general case that multi-hop h S — 1}. 

In addition, the problem on only one dimensional spaces is far from the reality. 
Therefore we study the case of high dimensional Euclidean spaces. As far as 
we know, there was no bounded result for the general multi-hop h on high 
dimensional Euclidean spaces. Due to the wide applications of ad-hoc networks, 
it contains also stations in space or in ocean. In this case the model of Euclidean 
space is not applicable as the transmission distance (depending on environmental 
condition) is not equivalent to the Euclidean distance. Hence the model of general 
finite metric space must be introduced. 

In this paper, we first study the /i-broadcast and /i-strong connectivity prob- 
lems on trees for both homogeneous and non-homogeneous cases. We show that 
an optimal solution to the /i-broadcast problem can be obtained in polyno- 
mial time by dynamic programming. For the /i-strong connectivity problem, 
an 0(n^ )-approximation algorithm is proposed. Based on these algorithms, we 
study the problems on high dimensional Euclidean spaces and general metric 
spaces. A probabilistic 0(lognloglogn)-approximation algorithm and a prob- 
abilistic 0(n^ log n log log n)-approximation algorithm are presented for the h- 
broadcast problem and the /i-strong connectivity problem, respectively. The ra- 
tios are further improved for special cases. The main technique we use here is the 
algorithm to approximate a finite metric space by a collection of tree metrics. 

1.1 Approximating Metrics 

Given a finite metric space induced by an undirected unweighted or weighted 
graph, we need to embed it in a simpler metric space such that the distance 
between each pair of vertices is approximately preserved. With this technique, 
some hard problems can be solved approximately because an arbitrary metric 
space may not have enough structure for exploring. 

In [2], Bartal proposed the idea of probabilistic approximation of metric spaces 
by a set of simpler metric spaces. Given a graph G = (E, E), let M be a finite 
metric space defined on V and cIm the distance function. For two metric spaces 
Ml and M 2 defined on V , we call M\ dominates M 2 if c^Mi (m, v) > c?M 2 (m, v) for 
any pair u,v € V. Suppose 5 is a collection of metric spaces on V. Assuming 
that each metric space in S dominates M, S is defined to p- probabilistically 
approximate M, if there is a probability distribution p over S such that the 
expected distance distortion between any vertex pair in M in a space chosen 
from S according to p is at most p. A polynomial time algorithm to 0(log^ n)- 
probabilistically approximate any metric space on |E| = n vertices by a collection 
of tree metrics was addressed in [2]. The approximation ratio was improved 
to O(lognloglogn) in [3]. However, the numbers of the tree metric spaces are 
exponentially large in both algorithms. Gharikar et al. [5] developed a polynomial 
time algorithm to construct a probability distribution on a set of 0(n log n) trees 
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metrics for any given metric space induced by a (weighted) graph G on n vertices, 
such that the expected stretch of each edge is no more than O(lognloglogn). 

To decide the probability distribution /i, a linear program with exponential 
number of variables has to be solved. Indeed this linear program is a packing 
problem, which can be solved approximately by some fast algorithms [5, 15,23]. 
We notice that in the algorithms above the triangle inequality is not required. 
Therefore an instance of the problem on a general metric space can be reduced 
to the case on a tree with a stretch at most O(lognloglogn) (See Section 3 
and 4). 



1.2 Approximation Algorithms for the Packing Problem 

The algorithms for the packing problem are employed to compute the probabil- 
ity distribution p. over the collection of trees described in Subsection 1.1. The 
packing problem is to compute x* G B such that A* = A(a;*) = min{A|/(a;) < 
A • l,x € B}, where f : B ^ is a vector of M continuous convex func- 
tions defined on a nonempty convex compact set B C IR^, and 1 is the vec- 
tor of all ones. The functions fm, ^ < rn < M , are the packing constraints. 
X = {x\, . . . ,xn) € B. Then the approximate packing problem is to compute 
X & B such that f{x) < c(l -I- e)A* • 1, where £ > 0 is the given relative error 
tolerance and c > 1 is the approximation ratio of the block solver, which is a 
given oracle to solve the block problem: to compute x = x{y) G B such that 
f{x) < c(l -I- t) min{y^/(z )|2 G B}, where the price vector y G P = {y G 
TR^\E^=iym = i,ym>0}. 

There are many approximation algorithms for the packing problem [12, 13, 
15,18,20,22]. However, here the linear program of approximating metrics has 
only a weak block solver, i.e., c > 1. In such a case only the algorithms in [15] and 
[18] (with generalization in [5]) can be employed. The numbers of iterations are 
bounded by 0(M(logM -|- log£“^)) and plog{Me~^)), respectively, 

where p = maxi<m<M niaxa,gB /m(a^) is the width of B. Here we will use the 
algorithms for the packing problem in [15, 18] to approximate a general metric 
space by tree metrics in Section 3 and 4 as [5, 23] . 

The paper is organized as follows. In Section 2 the (approximation) algorithms 
for the /i-broadcast problem and the /i-strong connectivity problem are presented 
and analyzed. Then the cases of high dimensional Euclidean spaces and general 
finite metric spaces are studied in Section 3 and 4. 

2 Algorithms for Problems on Trees 

We will explore the range assignment problem on edge weighted trees in this 
section. Both homogeneous and non-homogeneous cases are investigated. For 
the term distance d{i,j) we mean the shortest path distance between station 
i and station j. We will consider the /i-broadcast problem and the /i-strong 
connectivity problem separately. Without loss of generality, we only study the 
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non-homogeneous case and the results can be directly applied in the homoge- 
neous case. The main idea we use is dynamic programming, similar to [16, 1]. 

2.1 The h-Broadcast Problem 

We use similar definitions as in [1] here. Suppose that the set of stations S is 
connected by a tree T. Let s G S = {1, 2, . . . , n} be the source station. Given 
a specified pair i and j, 1 < i < j < n, three sets are defined as 

follows, for any station p G S, 

{ S"l , if d{i,p) < d{i,j) and d{j,p) > d{i,j); 

Sm , if d{j,p) < d{i,j) and d{i,p) < d{i,j); 

, otherwise. 

We consider two sets A, B G S = 5^^ U U and An B = Let rA,B 
be a minimum cost range assigned to stations in A such that for any station 
b G B, there exists a station a G A with rA,B{a) > d{a,b), where rA,B{<i) is the 
range assigned to a and d(a, b) is the distance between a and b in T. Denote 
by cost{A, B) the cost of r{A, B) and by cov{A, B) the set of stations such that 
for any u G cov(A,B), rA,B(a) > d(a,u) for some a G A. Finally, we denote 
MSB'lf {A^B) to be any optimal multi-source broadcast where the source set 
is A and the target set is B, in which every station in B is connected to some 
station in A with at most h hops, h G {1,2, •••,n — 1}. Let M^\A) be the 
cost of M SB^jf [A, S — A), for any A C S'. From above definitions, dynamic 
programming yields 

{ 0 , = 0 ' 

[So = U cov{S]^ , {u})}, otherwise, 

for all according to all pairs of i and j, and h G {1, . . . ,n — 1}. 

Theorem 1. The h-broadcast problem on trees can he solved in 0{hn'^) time. 

Proof. The correctness holds with the same argument as [16, Ij. We need to com- 
pute all tables . . . , for all pairs i, j G S. Table is directly com- 
puted at first. Then by dynamic programming we can compute all cost{S ]^ , {«}) 
in O(n^) time according to pair of i and j. With the pre-computed cost and 
we are able to obtain , where I G |l,...,/i}. This procedure takes 
0{n^) time. Overall the algorithm runs in 0{hn^) time and returns the optimal 
solution M^’^(S^®). 

2.2 The h-Strong Connectivity Problem 

For the station set S = (1, 2, • • • , n} connected by a tree T, we define a path set 
P{i,j) as the stations in the path from i to j, for any 1 < i,j < n. Notice that 
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\{P{i,j)\i,j G -S'}! = O(n^). For any path set P{i,j), the stations in it can be 
regarded as aligned on a line. Therefore for a given h, the algorithm in [7] for the 
homogeneous /i-strong connectivity problem can be applied here and generate 
a range assignment for P(i,j) with a cost at most twice of the optimum. This 
algorithm holds also for the non-homogeneous case [1]. For each I G {1, . . . , /i} 
and each pair 1 < i,j < n, we can compute a range assignment for all P(i,j) 
by the algorithm in [7] for one dimensional case by dynamic programming. This 
generates at most 0{v?) different tables, in which every table corresponds to a 
2-approximate range assignment for a specific path set P{i,j). 

Now we consider any station u G S'. Obviously u is assigned at most O(n^) 
ranges because u is in at most O(n^) path sets. Among them we assign the 
maximum range in the O(n^) candidates to u. Denote by rg the range assign- 
ment over S by the above rule. Moreover, we define rj to be the optimal range 
assignment over S. 

Theorem 2. There is an 0{n^)~ approximation algorithm for the h-strong con- 
nectivity problem on trees with a running time hounded by 0{hn^). 

Proof. Define r*{u) the range assigned to u G S in the optimal solution for S. 
For a fixed pair i and j, it is obvious that cost{rp(^ij)) < j) cost{r* {u)) 

as rp(^i j) is a 2-approximate range assignment for P{i,j). As showed before, 
there are at most O(n^) path sets. Hence costsum = '^i'^j^ost{rp(j_j'^) < 

2 E* Ej E„e P(i j) cost(r*(u)) < 0{n^)cost{r*g). Since in our algorithm, for ev- 
ery station we just take only the maximum range among the solutions over paths 
containing that station, it holds that cost{rs) = E« ™ax„gp(j cost{rp(^i j)) < 
costsum- Thus cost{rs) < 0{n^)cost{r*g). 

As for the running time, there are at most 0{n^) tables. For each table the 
2-approximation range assignment can be computed by dynamic programming 
in at most 0{hn^) time [7]. Therefore the total running time is 0{hn^). 

3 Algorithms for Problems 

on High Dimensional Euclidean Spaces 

We now turn to the d-dimensional Euclidean spaces, where d > 2. We will de- 
velop approximation algorithms for the /i-broadcast problem and the /i-strong 
connectivity problem based on the (approximation) algorithms on trees in Sec- 
tion 2. 

For a given station set S, we also denote by S' G M"* the space induced by 
a complete graph defined on station set S with distance function in M"*. The 
two-phase algorithm ACGe is as follows: In the first phase, a collection T of N 
tree metrics spanning S and a probability distribution /i on T are generated. 
Then we select a tree metric G T according to its probability Xi. In the second 
phase, the algorithms for the /i-broadcast problem and the /i-strong connectivity 
problem in Section 2 are applied for the corresponding problems on the selected 
tree metric T^. In this way we are able to obtain an approximate solution for the 
range assignment problem. 
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With the analysis in Section 2, for the selected tree metric Ti with a distance 
function dj’. , an optimal solution for the /i-broadcast problem and an 0(n^)- 
approximate solution for the /i-strong connectivity problem can be found, where 
n = |S'|. Thus we will focus on the first phase of ACQs, i.e., how to construct 
the tree metrics in T and the probability distribution /i. 

We first define an edge weighted complete graph Ms on the vertex set S. For 
each pair of vertices u, v G S, the length of edge (u, v) is Ims{u, v) = (ds(u, u))“, 
where ds is the distance function in original space S. It is obvious that the 
shortest path distance between u and v is bounded by the length of edge (u, v) 
in Ms, i.e., dMs{u,v) < Ims{u,v). In this case the cost that station u can cover 
V is cost{u,v) = dMs{u,v) = {ds{u,v))°‘ . 

Then based on the algorithm in [5] we can find the collection T, in which 
any tree Ti, i G {1, . ■ ■ , N}, induces a metric space that dominates Ms, i.e., for 
any two stations u and v, dTi(u,v) > dMs{u,v). The tree metrics in T will be 
employed to approximate the graph Mg. To generate a probability distribution 
over T, we can assign each T G T a, real number Xi G [0, 1]. Denoting by A the 
maximum stretch over all edges, simple observation yields the following linear 
program: 



min A 

dTi{e)xi < AdMs(e), for any edge e; 

Xi > 0 . 



Here the first set of constraints indicates that the expected length of the paths in 
tree metrics in T corresponding to edge e G Mg is no more than the maximum 
stretch of the length. And the other constraints are directly from the definition 
of probability distribution. This problem can be formulated as a packing prob- 
lem described in Subsection 1.2. Theoretically a linear program can be solved 
efficiently. However, here the number of variables in (1) (i.e., the number of trees 
in T) is unbounded and can be exponentially large. 

We use the algorithms in [15,18] to solve (1) approximately. Only a poly- 
nomial number of tree metrics will be generated (assigned with non-zero prob- 
ability). In the algorithms an initial solution is set at the beginning. Then the 
algorithms run the iterative procedure. In each iteration a pair of solutions to 
the packing problem and its dual problem is computed. There are following three 
steps in one iteration: First with the known solution, a price vector y related to 
the dual value is calculated to determine the direction of moving of the iterate. 
Then a block solver is called to generate an (approximate) block solution for y 
and the error tolerance e. Finally a new solution is obtained as a linear combina- 
tion of the known solution and block solution with an appropriate step length. 
When certain stopping rules are satisfied (which indicates that the duality gap 
is small enough to fulfil the accuracy requirement), the iterative procedure ter- 
minates and the desired approximate solution is returned by the last iteration. 
Details of the algorithm can be found in [15,23]. 

In order to complete the algorithm for (1), we need to consider the block 
problem, which is related to the dual problem. As showed in Subsection 1.2, 
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given a price vector y G P = {{yi, ■ ■ ■ ,ym)\J2T=iye = > 0}, the block 

problem is to find an x such that y"^ f{x) = mina;gB y^/(a;). By the definitions, 
the right hand side can be simplified as follows: 



N 



min 

x&B 



= min 

kG-B 



iny^/(x) = mm^ ( 2/e • ^ 



N 



^ U. -^2/, 



e=l \ 

dTi (e) 



“ c/Ms(e) 



^ dMs(e) J i<*<^^dMs(e) 



E' 



-d,Ti{e). 



The last equality holds because we can choose one tree Ti with smallest value 
of X)e 2/et/Ti(e)/dMs(e) and set its probability Xj = 1 to achieve the minimum. 
Denote by w{e) = ye/dMsi^) the weight on edge e. Therefore the goal of the 
block problem is to find a tree T connecting all vertices in Ms such that the 
value '^gW(e)dT(e) is minimized, for all edge e in Ms- This problem is in fact 
the minimum communication cost spanning tree (MCCST) problem. 

However, the MCCST problem is AfT^-hard [11, 14,21]. The best determin- 
istic polynomial time approximation algorithm is addressed in [3] and [4] in- 
dependently. And the approximation ratio is O(lognloglogn). The worst case 
stretch of any edge is bounded by 0{n). 

To solve (1), we have to call a solver of the MCCST problem as an ora- 
cle. Here it is worth noting that the approximation ratio of the block solver is 
c = O(lognloglogn), i.e., a weak block solver. As a result, the approximation 
ratio of the solution to (1) is also O (log n log log n). Hence many approximation 
algorithms for the packing problem are not applicable. The approximation algo- 
rithm for the packing problem in [18] was generalized to the case of weak block 
solver in [5]. The number of iterations is bounded by 0(e“^plog(me“^)) for any 
given error tolerance e G (0, 1], where p = maxe max^jg^ dTi{e)xi/dMsi^) 
in Subsection 1.2. In the problem of approximating metric spaces by tree metrics, 
p = 0{n). Hence totally at most 0(n£“^ log(me“^)) iterations are required to 
obtain an 0(lognloglogn)-approximate solution to (1) and in each iteration an 
0(log n log log n)-approximate MCCST solver is called once and one tree metric 
in T is generated and assigned a positive probability. Therefore when algorithm 
halts there are N = \T\ = 0{ne~^ log{me~^)) tree metrics. If the algorithm in 
[3] is applied, the number of trees in T will be exponentially large. 

Finally, in the second phase of the algorithm ACQs, approaches on tree met- 
rics in Section 2 are employed. Notice that the distance of tree metrics in T in 
fact corresponds to the cost of original Euclidean space S, and recall that the 
approximation ratio of the first phase is O(lognloglogn). With the property of 
the solution in the second phase, denoting by f3 the running time of the MCCST 
solver, we have the following theorems: 

Theorem 3. There exists a probabilistic O{lognloglogn)-approximation algo- 
rithm for the h-broadcast range assignment problem in static ad-hoc networks on 
d dimensional Euclidean spaces for d > 2, with a running time 0{hn'^ -\-n (3 log n) . 
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Theorem 4. There exists a probabilistic 0{n^ lognloglogn)-approximation al- 
gorithm for the h-strong connectivity range assignment problem in static ad-hoc 
networks on d dimensional Euclidean spaces for d > 2, with a running time 
0{hn^ -\- nfdlogn). 

The algorithm in [15,23] can be also applied here with the same approxima- 
tion ratio but a different bound on running time. 

3.1 Improved Algorithms for the Case of Small a 

We notice that in [5] a probabilistic 0{\/dlog n)-approximation algorithm for the 
problem of approximating metrics by tree metrics on d dimensional Euclidean 
spaces is addressed. In fact here we can apply this algorithm in the case of small 
a. Since in real applications, the problems are mostly defined on two or three 
dimensional Euclidean spaces. Therefore in this subsection we consider d as a 
constant. 

If the distance-power gradient a < 1 -I- 0(logloglogn/loglogn), then we do 
not define the complete graph Ms and in the linear program (1), dMsi^) is re- 
placed by ds{e). Afterwards we generate the collection T of trees to approximate 
S directly. In the first phase we apply the algorithm for real normed spaces in [5] 
as a weak block solver. And the the stretch of any edge in S is at most 0(log n). 
Thus in the first phase of the algorithm the cost increases by a factor at most 
0(log“n). We have 

log“n < = logn(logn)‘^i^°®*°®^°®"/*°®^°®"i 

= logn(logn)‘^i^°®'°«" = O (log n log log n). 

Thus the approximation ratio of the algorithm is 0(log“n), which is not more 
than O(lognloglogn). For this special case we have the following corollaries: 

Corollary 1. There exists a probabilistic Oi\og°‘ n)- approximation algorithm 
for the h-broadcast range assignment problem in static ad-hoc networks on d 
dimensional Euclidean spaces for d>2, with a running time 0{hn'^ -\- nfdlogn) , 
for a < 1 -I- 0(log log log n/ log log n) . 

Corollary 2. There exists a probabilistic 0{n^ log“ n)- approximation algorithm 
for the h-strong connectivity range assignment problem in static ad-hoc networks 
on d dimensional Euclidean spaces for d > 2, with a running time 0{hn^ -\- 
n(3 log n) , for a < 1 -I- O (log log log nj log log n) . 

In fact the case a < I -I- 0(log log log n/ log log n) does exist. If n = 256, then 
log log log n/ log log n = log3/3 « 0.53. Then 1 -|- log log log n/ log log n « 1.53 
We believe that this algorithm can find many real applications. 

4 Algorithms for Problems on Finite Metric Spaces 

We study the approximation algorithms for the range assignment problem in 
static ad-hoc networks on any finite metric spaces in this section. Assume that 
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G = (S', E) is a metric space defined on the set of stations S and E is the set 
of edges. The length function is defined hy Iq ■ E ^ U {0}. The idea to 
solve this problem is essentially the same as Section 3 with a different distance 
function va Ti & T , i = 1, . . . , N . Thus we develop the algorithm ACQg for the 
general metric spaces. 

Similarly, we focus on the first phase. We also define a complete graph Mq 
in advance, which is induced by S and the length of edge (u, v) is defined as 
Img{u,v) = {da{u,v))°‘ (here da{u,v) is the metric distance in G). And the 
shortest path distance between the pair u and v is c?Mg ^ ^Mg (u, v). Hence 
the cost that the station u can cover v is c{u,v) = dMci'^t'^) ^ {dG{u,v))°‘. 
Thus we obtain a similar linear program to (1) for the probability distribution. 
The only difference is that the first set of constraints are replaced by 

N 

''^dTi{e)xi < XdMc{^)j for any edge e. (2) 

i=l 

Then we can also use the approximation algorithm in [5] to solve this linear 
program. Denoting by weight function w(e) = ye/ dMc{d)^ the block problem is 
also the MCCST problem but with a different weight function. The algorithms 
in [3,4] are applied as an 0(lognloglogn)-approximate block solver. With the 
similar arguments in Section 3 we have the following theorems: 

Theorem 5. There exists a probabilistic O{lognloglogn)-approximation algo- 
rithm for the h-broadcast range assignment problem in static ad-hoc networks on 
finite metric spaces with a running time 0{hn'^ + n/31ogn). 

Theorem 6. There exists a probabilistic 0{n^ log nloglogn)- approximation al- 
gorithm for the h-strong connectivity range assignment problem in static ad-hoc 
networks on finite metric spaces with a running time 0{hn^ + n(3\ogn). 

The algorithm in [23] can be also applied here with a number of iterations 
bounded by 0(n^(logn + log£“^)). We notice that in [19] a lower bound 

0(log n) for the /i-broadcast problem on general metric spaces is proved, which 
only differs from our result by a factor of O(loglogn). 

4.1 Better Algorithms for the Case of Small a 
and Real Normed Spaces 

We can still use the 0(log n)-approximation algorithm in [5] for real normed 
spaces here to generate the collection T as the tree metrics are used to approx- 
imate the original space G. In fact the algorithm is employed here for the case 
of Of = 1 -I- 0(logloglogn/loglogn), similar to Subsection 3.1. With the same 
arguments the following corollaries hold: 

Corollary 3. If a < 1-1- 0(logloglogn/loglogn), then there exists a proba- 
bilistic Oi\og°‘ n)- approximation algorithm for the h-broadcast range assignment 
problem in static ad-hoc networks on real normed spaces with a running time 
0{hn‘^ -\- npiogn). 
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Corollary 4. If a < l-|-0(logloglogn/loglogn), then there exists a probabilis- 
tic 0{n^ log“ n)- approximation algorithm for the h-strong connectivity range as- 
signment problem in static ad-hoc networks on real normed spaces with a running 
time 0{hn^ -\- n/31ogn). 

5 Concluding Remarks 

In this paper we have studied the range assignment problem in static multi-hop 
ad-hoc networks on trees, high dimensional Euclidean spaces and general finite 
metric spaces. We proposed (approximation) algorithms for these problem in 
both the homogeneous and the non-homogeneous cases. And the approximation 
ratio O(lognloglogn) for the /i-broadcast problem on general metric spaces is 
close to the known lower bound O(logn) [19]. We also presented improved ap- 
proximation algorithms for both problems when a (the distance-power gradient) 
is small. There are some interesting open problems: 

1. Is the /i-strong connectivity problem on one dimensional spaces polynomial 
time solvable? 

2. Can we obtain an 0(log n)-approximation algorithm for the /i-broadcast 
problem on general metric spaces to reach the lower bound? 

3. Can we improve the result of the /i-strong connectivity problem such that 
the factor 0{n^) in the approximation ratio is eliminated? 
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